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PREFACE 



The present volume is the second in the author’s series of 
three dealing with abstract algebra. For an understanding of 
this volume a certain familiarity with the basic concepts treated 
in Volume I: groups, rings, fields, homomorphisms, is presup- 
posed. However, we have tried to make this account of linear 
algebra independent of a detailed knowledge of our first volume. 
References to specific results are given occasionally but some of 
the fundamental concepts needed have been treated again. In 
short, it is hoped that this volume can be read with complete 
understanding by any student who is mathematically sufficiently 
mature and who has a familiarity with the standard notions of 
modern algebra. 

Our point of view in the present volume is basically the abstract 
conceptual one. However, from time to time we have deviated 
somewhat from this. Occasionally formal calculational methods 
yield sharper results. Moreover, the results of linear algebra are 
not an end in themselves but are essential tools for use in other 
branches of mathematics and its applications. It is therefore 
useful to have at hand methods which are constructive and which 
can be applied in numerical problems. These methods sometimes 
necessitate a somewhat lengthier discussion but we have felt that 
their presentation is justified on the grounds indicated. A stu- 
dent well versed in abstract algebra will undoubtedly observe 
short cuts. Some of these have been indicated in footnotes. 

We have included a large number of exercises in the text. 
Many of these are simple numerical illustrations of the theory. 
Others should be difficult enough to test the better students. At 
any rate a diligent study of these is essential for a thorough un- 
derstanding of the text. 
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from the advice and criticism of many friends. Thanks are par- 
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proofs and for last minute suggestions for improvements of the 
text. 
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Chapter I 



FINITE DIMENSIONAL VECTOR SPACES 



In three-dimensional analytic geometry, vectors are defined geo- 
metrically. The definition need not be recalled here. The im- 
portant fact from the algebraic point of view is that a vector v 
is completely determined by its three coordinates (£, 77, f) (rela- 
tive to a definite coordinate system). It is customary to indi- 
cate this by writing v = (£, 77, f), meaning thereby that o is the 
vector whose x- y y- y and z-coordinates are, respectively, £, 77, and 
f. Conversely, any ordered triple of real numbers (£, 77, £) de- 
termines a definite vector. Thus there is a 1-1 correspondence 
between vectors in 3-space and ordered triples of real numbers. 

There are three fundamental operations on vectors in geometry: 
addition of vectors, multiplication of vectors by scalars (numbers) 
and the scalar product of vectors. Again, we need not recall the 
geometric definitions of these compositions. It will suffice for our 
purposes to describe the algebraic processes on the triples that 
correspond to these geometric operations. If v = (£, rj y f) and 
v' = (£', v'> then the sum 

v + v f = (£ + v + v'> f + r'). 

The product pv of the vector v by the real number p is the vector 

pv = (pf, pv> pD 

and the scalar product (v y v f ) of v and v r is the real number 
(v y v ') = ££' + 7717' + ff'. 

A substantial part of analytic geometry — the theory of linear 
dependence and of linear transformations — depends only on the 

1 
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first two of these concepts. It is this part (in a generalized form) 
which constitutes the main topic of discussion in these Lectures. 
The concept of scalar product is a metric one, and this will be 
relegated to a relatively minor role in our discussion. 

The study of vectors relative to addition and multiplication 
by numbers can be generalized in two directions. First, it is not 
necessary to restrict oneself to the consideration of triples; in- 
stead, one may consider ^-tuples for any positive integer n. 
Second, it is not necessary to assume that the coordinates f, 77, 

• • • are real numbers. To insure the validity of the theory of 
linear dependence we need suppose only that it is possible to 
perform rational operations. Thus any field can be used in place 
of the field of real numbers. It is fairly easy to go one step fur- 
ther, namely, to drop the assumption of commutativity of the 
basic number system. 

We therefore begin our discussion with a given division ring A. 
For example, A may be taken to be any one of the following sys- 
tems: 1) the field of real numbers, 2) the field of complex num- 
bers, 3) the field of rational numbers, 4) the field of residues 
modulo p , or 5) the division ring of real quaternions. 

Let n be a fixed positive integer and let A (n) denote the to- 
tality of ^-tuples (£i, % 2 , • • •, f n ) with the & in A. We call these 
^-tuples vectors , and we call A (n) the vector space of n-tuples over 
A. If jy = ( 771 , 772 , • • •, 77 n ), we regard x = y if and only if & = ru 
for / = 1, 2, • • •, n. Following the pattern of the three-dimen- 
sional real case, we introduce two compositions in A (n) : addition 
of vectors and multiplication of vectors by elements of A. First, 
if x and y are arbitrary vectors, we define their sum x + y to be 
the vector 

x + y = (£l + r]iy £2 + 772, • * £n + V n). 

As regards to multiplication by elements of A there are two possi- 
bilities: left multiplication defined by 

px = (p£ 1, p£ 2 > • • • , p£n) 
and right multiplication defined by 



Xp = (£1 P, £2 P, • y £np). 
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Either of these can be used. Parallel theories will result from the 
two choices. In the sequel we give preference to left multiplica- 
tion. It goes without saying that all of our results may be trans- 
ferred to results on right multiplication. 

The first eight chapters of this volume will be devoted to the 
study of the systems A (n) relative to the compositions we have 
just defined. The treatment which we shall give will be an axio- 
matic one in the sense that our results will all be derived from a 
list of simple properties of the systems A (n) that will serve as 
axioms. These axioms define the concept of a finite dimensional 
( abstract ) vector space and the systems A (n) are instances of such 
spaces. Moreover, as we shall see, any other instance of a finite 
dimensional vector space is essentially equivalent to one of the 
systems A (n) . 

Thus the shift to the axiomatic point of view is not motivated 
by the desire to gain generality. Its purposes are rather to clar- 
ify the discussion by focusing attention on the essential proper- 
ties of our systems, and to make it easier to apply the results to 
other concrete instances. Finally, the broadening of the point 
of view leads naturally to the consideration of other, more gen- 
eral, concepts which will be useful in studying vector spaces. 
The most important of these is the concept of a module which 
will be our main tool in the theory of a single linear transforma- 
tion (Chapter III). In order to prepare the ground for this ap- 
plication we shall consider this concept from the beginning of our 
discussion. 

The present chapter will be devoted to laying the foundations 
of the theory of vector spaces. The principal concepts that we 
shall consider are those of basis, linear dependence, subspace, 
factor space and the lattice of subspaces. 

1. Abstract vector spaces. We now list the properties of the 
compositions in A (n) from which the whole theory of these sys- 
tems will be derived. These are as follows: 

A1 (x + y) + z = * + (y + z). 

A2 x + y = y + x. 

A3 There exists an element 0 such that x + 0 = x for all *. 
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A4 

51 

52 

53 

54 
F 



For any vector x there exists a vector — x such that x + 

{—X s ) = 0 . 

a(x + y) = ax + ay. 

{a + fi)x = ax + fix. 

(afi)x = a( fix). 

\x = X. 

There exist a finite number of vectors e x , e 2 , • • •, e n such 
that every vector can be written in one and only one way 
in the form £i?i + £ 2^2 + • • ■ + £ re ^V 



The verifications of Al, A2, S1-S4 are immediate. We can 
prove A3 by observing that (0, 0, • • • , 0) has the required prop- 
erty and A4 by noting that, if x = (£ 1} • • •, £„), then we can 
take —x= ( — £ 1 , • • •, — £„). To prove F we choose for 



1 

(1) ei = (0, 0, • • •, 0, 1, 0, • • •, 0), i = 1, 2, •••,». 

n 

Then £*a has & in its /th place, 0’s elsewhere. Hence ^ £*■ a = 

i 

(£i> * * •, fn)- Hence if x = (£ 1 , • • •, £n)> then x can be 

written as the “linear combination ,, 2 £*a of the vectors e Also 
our relation shows that, if 2 £*a = 2aA> then (£ x , f 2 , * * * , £ n ) = 
(?7i> 12 , m • Vn) so that & = r}i for i = 1, 2, • • - y n. This is what 
is meant by the uniqueness assertion in F. 

The properties A1-A4 state that A (n) is a commutative group 
under the composition of addition. The properties S1-S4 are 
properties of the multiplication by elements of A and relations 
between this composition and the addition composition. Prop- 
erty F is the fundamental finiteness condition. 

We shall now use these properties to define an abstract vector 
space . By this we mean a system consisting of 1) a commutative 
group (composition written as +), 2) a division ring A, 3) a 
function defined for all the pairs (p, x ), p in A, x in 9t, having values 
px in dt such that S1-S4 hold. In analogy with the geometric 
case of ^-tuples we call the elements of 9? vectors and the elements 
of A scalars . In our discussion the emphasis will usually be placed 
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on 9t For this reason we shall also refer to 9i somewhat inex- 
actly as a ‘Vector space over the division ring A.” (Strictly 
speaking dt is only the group part of the vector space.) If F holds 
in addition to the other assumptions, then we say that 9? is finite 
dimensional , or that 9 1 possesses a finite basis over A. 

The system consisting of A (n) , A, and the multiplication px de- 
fined above is an example of a finite dimensional vector space. 
We shall describe next a situation in the theory of rings which 
gives rise to vector spaces. Let 9£ be an arbitrary ring with an 
identity element 1 and suppose that 9 1 contains a division sub- 
ring A that contains 1. For the product px, p in A, and x in 9f 
we take the ring product px. Then S1-S3 are consequences of 
the distributive and associative laws of multiplication, and S4 
holds since the identity element of A is the identity of 9t. Hence 
the additive group 9J, the division ring A and the multiplication 
px constitute a vector space. This space may or may not be 
finite dimensional. For example, if 9? is the field of complex 
numbers and A is the subfield of real numbers, then 91 is finite 
dimensional; for any complex number can be written in one and 
only one way as £ + 77V— 1 in terms of the “vectors” 1, ^ — 1. 
Another example of this type is 9£ = A[X], the polynomial ring 
in the transcendental element (indeterminate) X with coefficients 
in the division ring A. We shall see that this vector space is not 
finite dimensional (see Exercise 1, p. 13). Similarly we can re- 
gard the polynomial ring A[Xi, X 2 , • • •, X r ] where the X* are alge- 
braically independent (independent indeterminates) as a vector 
space over A. 

Other examples of vector spaces can be obtained as subspaces 
of the spaces defined thus far. Let 9i be any vector space over 
A and let © be a subset of 9J that is a subgroup and that is closed 
under multiplication by elements of A. By this we mean that if 
ye© and p is arbitrary in A then py e ©. Then it is clear that 
the trio consisting of ©, A and the multiplication py is a vector 
space; for, since S1-S4 hold in 9t, it is obvious that they hold 
also in the subset ©. We call this a subspace of the given vector 
space, and also we shall call © a subspace of 9i. As an example, 
let 9? = A[X] and let © be the subset of polynomials of degree 
<n. It is immediate that © is a subspace. Moreover, it is 
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finite dimensional since any polynomial of degree <n can be ex- 
pressed in one and only one way as a linear combination of the 
polynomials 1, X, 



EXERCISE 

1. Show that the totality © of homogeneous quadratic polynomials 
2 an in A, is a finite dimensional subspace of A[Ai, X 2 ]. 

i 



2. Right vector spaces. As we have pointed out at the begin- 
ning the system A (n) of ^-tuples can also be studied relative to 
addition and to right multiplication by scalars. This leads us to 
define the concept of a right vector space . By this we mean a 
system consisting of a commutative group 9F, a division ring A 



and a 


function of pairs (p, x‘ 


'), p in A, x' in 


di', having values x'p 


in dt' 


and satisfying: 






S'l 


( x ' + y')a 


= x'a + y'a. 




S'2 


x'(a + jS) 


= x'a + x'fi. 




S'3 


■v'(a/3 ) 


= ( x'a) 13 . 




S'4 


x'l 


= x' for all x' 


in SR'. 



Obviously the theory based on this definition will parallel that 
of left vector spaces. It should be noted, however, that a right 
space over A cannot be regarded as a left space over A if this 
division ring is not commutative. For if we write ax' for x'a y 
then we have by S'3 

(a(3)x f = x'{afS) = (x'a)/3 = fi(ax'). 

Hence S3: (fia)x f = /3(a#') holds only if 

[(<*$) — Q3a)]x' = 0 

for all x'. This together with S4 implies that a/3 = /3a for all a, /3. 

On the other hand, let A' be a division ring anti-isomorphic to 
A and let a — > a be any anti-isomorphism of A onto A'. Then 
if 9t' is a right vector space over A, 9t' may be considered a left 
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vector space over A'. This can be done by defining a!x r to be 
x'a. Then 



(a& f )x f = ($a)'x f = x'(/3a) = ( x'(3)a = (P'x')a = a (fix'). 



so that S3 is now satisfied. The verification of the other rules is 
also immediate. 

3. o-modules. Before embarking on the systematic study of 
finite dimensional vector spaces we shall consider briefly the gen- 
eralization to modules which will be very useful later on. This 
generalization is obtained by replacing in our definition the divi- 
sion ring A by any ring o that has an identity. Thus we define a 
(left) o- module to be a system consisting of a commutative group 
9?, a ring o with an identity and a function of pairs (p, x)> p in o 
and x in 9t with values px in 9t satisfying S1-S4.* It is evident 
from our definitions that a vector space is simply a A-module 
where A is a division ring. 

Besides the special case of a vector space we note the following 
important instance of an o-module: Let 9t be any commutative 
group written additively and let o be the ring of integers. If 
x e 9? and a e o, we define 



ax = 



x + x + • • • + x y 
0 if a = 0 



a times if a > 0 



— (# + #+•••+#)> —a: times if a < 0. 



Then S1-S4 are the well-known laws of multiples in 9 X. 

We note also that any ring with an identity o can be regarded 
as an o-module. As the group part 3t we take the additive group 
of o and we define ax for a in o and x in 9? to be the ring product. 
Properties S1-S4 are immediate consequences of the associative, 
distributive and identity laws for multiplication. 

As in the case of vector spaces a subset © of a module 9 1 de- 
termines a submodule if © is a subgroup of 91 that is closed rela- 
tive to the multiplication by arbitrary elements of o. Now let 

* This definition is a slight departure from the usual one in which 0 need not have an 
identity and only S1-S3 are assumed. We make this change here since we shall be in- 
terested only in rings with identities in this volume. Right 0 -; modules are obtained in the 
obvious way by replacing S1-S4 by S'l-S'4. 
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S = (x a ) be an arbitrary subset of 9? and let [S] denote the to- 
tality of sums of the form 

(2) £lX ai + £ 2 # a2 H b %rnX am 

where the are arbitrary in o and the x ai are arbitrary in S. We 
assert that [S] is a submodule. Clearly [6*] is closed under addi- 
tion and under multiplication by elements of o. Also it is easy 
to see (Exercise 1, below) that Ox = 0 and ( — £)# = — hold 
in any module, and these imply that [6*] contains 0 and the nega- 
tive of any element in [S]. Hence [S] is a submodule of 9f. We 
note also that [S] contains the elements x a = lx a of S and that 
[S] is contained in every submodule of 9t that contains S. Be- 
cause of these properties we shall say that [S] is the submodule 
generated by the set S. 

If [S] = 9?, then the set S is said to be a set of generators for 
9?. If 9? = [eiy e 2y • • •, e n ] for some finite set S = (e ly e 2 , • • •, e n ) y 
then we say that 9i is finitely generated . If there exists a set of 
generators S such that every x can be written in one and only 
one way in the form e ai in S y then 91 is called a free module 

and the set S is called a basis . Thus condition F states that a 
finite dimensional vector space is a free A-module with a finite 
basis. 

It is easy to construct, for any n y a free o-module with n base 
elements. The construction is the same as that of A (n) . We let 
o (n) denote the totality of ^-tuples (£ x , £ 2 > * * •> £ n ) with compo- 
nents & in o. Addition and multiplication by elements of o are 
defined as before. If the ei are defined by (1), it can be seen as 
in the case of A (n) that these elements serve as a basis for o (n) . 

We consider now the fundamental concept of equivalence for 
o-modules. Let 9? and be two o-modules defined with respect 
to the same ring o. We shall say that 9? and are o -isomorphic 

or simply equivalent if there is a 1-1 correspondence, x — » x of 

onto such that 

(3) x + y = x + J y ax = ax. 

Thus x — > x is an isomorphism between the groups 9t and 
satisfying ax = ax for all a and x . Such a mapping will be called 
an o- isomorphism or an equivalence . 
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If x = then by (3) x = Zatfi = Za^i = Za^. Hence 

if the elements ei are generators for 9?, then the corresponding 
elements ei are generators for If Za^i = 2/3^-, then Za^i = 
2j 6iei. It follows from this that, if 9? is a free module with basis 
then is free with basis These remarks illustrate the gen- 
eral principle that equivalent modules have the same properties, 
and need not be distinguished in our discussion. 

Suppose now that 9i and are two free o-modules and sup- 
pose that both of these modules have bases of n elements. Let 
the basis for 9i be e ly e 2y • • •, e n and that for $ be e u e 2y • • •, e n . 
Then if x is any element of 9J, we write x = 2£*a*, and we asso- 
ciate with this element the element x = 2£^ of 9L Since the ei 
and the are bases, this correspondence is 1-1 of 9? onto 
Moreover, ify = Zrjiei, theny = Zrifi while* + y = 2(& + Vi)^i 
and 

X ~\ ~ y ~ V i)€% = ’ == % T" y • 

Also — 

ax = 2 = aZ&ei = ax. 

Hence 9T and are equivalent. This proves the following 

Theorem 1. Any two free o- modules which have bases of n ele- 
ments are equivalent. 

In particular we see that any finite dimensional vector space 
with a basis of n elements is equivalent to the space A (n) of 77- 
tuples. This substantiates the assertion made before that the 
study of finite dimensional vector spaces is equivalent to the 
study of the concrete systems A (n) .* 

EXERCISES 

1. Prove the following rules for any O-module: 1) a:0 = 0, 2) a(— x) = —ax, 
3) Ox = 0, 4) (— a)x = —ax. 

2. Show that any subset of an 0-module which is closed relative to addition 
and to multiplication by elements of 0 is a submodule. 

3. If 9J is a vector space, then ax = 0 only if a = 0 or = 0. 

4. Linear dependence. From now on, unless otherwise stated, 
9t will be a finite dimensional vector space over A with basis e iy 

* A fuller account of the theory of modules can be found in Chapter VI of Volume I of 
these Lectures. However, the present discussion should be adequate for our purposes. 
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e 2y • • e n . It is easy to see that this basis is not uniquely de- 
termined. For example, the set + e 2y e 2y £ 3 , • • •, e n is a second 
basis and, if a ^ 0, the set ae iy e 2 , • • •, e n is also a basis. A fun- 
damental theorem we shall prove in the next section is that the 
number of vectors in any basis is the same. Hence the number n y 
which we shall call the dimensionality of 9? over A, is an invariant. 
As a necessary preliminary to the proof of this theorem we in- 
vestigate now the fundamental concept of linear dependence of 
vectors. 

We say that a vector x is linearly dependent on a set of vectors 

m 

S if x e [S], This is equivalent to saying that x — ^ for 

1 

suitable & in A and suitable Xi in S. This proves the first of the 
following obvious properties of linear dependence: 1) If x is lin- 
early dependent on a set S , then x is linearly dependent on a 
finite subset of S; 2) x is linearly dependent on the set S = (x); 
3) if x is linearly dependent on S and T is a set containing S> 
then ^ is linearly dependent on T; 4) if x is linearly dependent on 
S and if every x a e S is linearly dependent on the set T y then x is 
linearly dependent on T . 

The vectors x u x 2y • • *, x m are linearly dependent if there exist 
not all 0 in A such that p x Xi + P 2 X 2 + • • • + P m Xm = 0. Since 
/3x = 0 if and only if either /3 = 0 or x = 0, a set consisting of a 
single vector x is linearly dependent if and only if x = 0. If 
m > 0 and the Xi are linearly dependent, then we can suppose 
that, say, 5 ^ 0. Then 

m — 1 

x m = —/3m -1 E = 2( — /3m -1 )&-*y 
1 

so that is linearly dependent on (#i, x 2y ••*, Con- 
versely if x m is linearly dependent on (x ly x 2y • • •, then 

the vectors x ly x 2y • • *, x m are linearly dependent. Thus a set of 
more than one vector is a linearly dependent set if and only if 
one of the vectors in the set is linearly dependent on the remain- 
ing ones. If r < m and * 1 , • • •, x r is a dependent set, then so is 

r m 

x u • • •, x m ; for if £ = 0, then X = 0 if we take /3 r+1 

1 1 

= • • • = /3m = 0. 
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If Xi y • • *, x m are not linearly dependent, then these vectors 
are said to be linearly independent . The last property noted for 
dependent sets may also be stated in the following way: Any 
non-vacuous subset of a linearly independent set is a linearly in- 
dependent set. In particular, every vector in a linearly independ- 
ent set must be 7^ 0. 

The following property will be used a number of times. Hence 
we state it formally as 

Lemma 1. If x 1, x 2y * • *, x m are linearly independent and x Xy x 2 , 
• • *, x my x m+ i are linearly dependent , then x m+ i is linearly depend- 
ent on x u • • *, x m . 

Proof. We have PiXi + p 2 x 2 + • • • + + p m +ix m+ i = 0 

where some p k 5* 0. If P m +i = 0, this implies that x u • • •, x m 
are linearly dependent contrary to assumption. Hence P m +i 9^ 0. 
We may therefore solve for x m+ i obtaining an expression for it 
in terms of x iy • • •, x m . 

We shall also require the following 

Lemma 2. Let x iy x 2y • • *, x m be a set of m > 1 vectors and de- 
fine x/ = Xifor i = 1 , 2 , • • *, m — 1 and xj = x m + px 1 . Then 
the Xi are linearly independent if and only if the x/ are linearly in- 
dependent . 

Proof. Suppose that the Xi are linearly independent and let \ ft* 
be elements of A such that 2j Qix/ = 0. Then 

Pl X l + $2^2 + ‘ ‘ * + fim—lXm — l + Pm(x m + pX 1 ) = 0 

so that 

(^1 PmP)X 1 T" @2X2 "T * * ' Pm^m = 0. 

Hence Pi + P m p = P 2 = ■ * * = Pm = 0, and this implies that all 
the Pi = 0. This proves that the x/ are linearly independent. 
Now Xi = x/ for i = 1, 2, • • •, m — 1 and* w = x m f — pxf; hence 
the relation between the two sets of vectors is a symmetric one. 
We can conclude therefore that if the x/ are linearly independ- 
ent, then so are the X{. 

Evidently we can generalize this lemma to prove that the two 
sets x u x 2y • • *, x m and xf y x 2 ' y • • *, x m f where xf — Xi and x/ 
= Xj + PjX iy j = 2, • • •, m are either both dependent or both in- 
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dependent; for we can obtain the second set from the first by a 
sequence of replacements of the type given in Lemma 2. 

We come now to one of the fundamental results of the theory 
of vector spaces. 

Theorem 2. If $1 has a basis of n vectors , then any n + 1 vec- 
tors in 9t are linearly dependent . 

Proof. We prove the theorem by induction on n . Let e ly e 2y 
••*,^bea basis and let x ly x 2y • • •, x n+ i be vectors in 9?. The 
theorem is clear for n = 1 ; for, in this case, x x = a x e ly x 2 = a 2 e x 
and either x x = 0 or x 2 = a 2 ai~ l x x , We assume now that the 
result has already been established for spaces that have bases of 
n — 1 vectors. Suppose that the vectors x u x 2y • • •, x n +i are 
linearly independent and let 

**1 = Oi\\e\ + OL\ 2 e 2 + * * * + OL\nCn 

, x 2 — a 2 \e\ + a 22 e 2 + • * • + a 2 n e n 

( 4 ) 



%n + 1 — + cx. n _|_i t2 e 2 + • • • + OL n+ \, n e n 

be the expressions for the ^’s in terms of the basis. Now we may 
assume Xi ^ 0. Hence we can suppose that one of the an, say 
a lny is ^ 0. Then the set xf, x 2 ' y • • •, x n+ f where xf = x x and 
x / = Xj — oLjnOtin^Xi, j > 1, is a linearly independent set. It 
follows that the vectors x 2 \ xf y • • *, x n +i r are linearly independ- 
ent. But by (4) these x/ do not involve e ny that is, x/ e © = 
[eiy e 2y • • •, d’n-i]. Since the ^*, i < n — 1, form a basis for ©, 
this contradicts the fact that the theorem holds for n — 1, and 
the proof is complete. 

Remarks . 1) Since any non-vacuous subset of a linearly inde- 

pendent set of vectors is a linearly independent set, Theorem 2 
evidently implies that any r > n vectors in a space with a basis 
of n vectors are linearly dependent. 

2) Let S be a set of vectors and let x iy x 2y • • •, x r be linearly 
independent vectors in S . Either every set (#i, x 2y • • •, x ry x) y 
x in S y is linearly dependent or there exists an x r+ i e S such that 
Oi, x 2y •••, x r -f-i) is independent. Similarly either every set 
(* 1 , x 2y • • •, x r +i y x) y x in S y is dependent or there is an x r+2 in S 
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such that (x ly x 2y •• *, x r+2 ) is independent. After a finite num- 
ber of steps we obtain (# 1 , x 2y • • *, # m ), in S y a linearly inde- 
pendent subset of S such that any larger subset of S is linearly 
dependent. Thus any linearly independent subset of a set of 
vectors S can be imbedded in a maximal linearly independent 
subset of S. 

3) The method of proof of Theorem 2 can be used to test a 
given finite set x ly x 2y • • •, x m for linear dependence. If x x = 0 
the set is certainly dependent. Otherwise, we can replace this 
set by x u x 2y • • *, x m ' where Xi involves, say e n , but the x/ do 
not, and such that the second set is linearly independent if and 
only if the original set is linearly independent. Now it is easy 
to see that since Xi involves e n while the x/ do not, then x u x 2y 
• • •, x m ' is linearly independent if and only if x 2y x 3 ' y • • *, x m f is 
linearly independent. This reduces the problem to one of test- 
ing m — 1 vectors in a space with a basis of n — 1 vectors. 

EXERCISES 

1. Prove that the vector space A[A] of polynomials in X is infinite dimensional. 

2. Test for linear dependence: 

(a) (2, -5,2, -3), (-1, -3,3, -1), (1, 1, -1,0), (-1, 1,0, 1) 

(b) (2, -3, 0, 4), (6, -7, -4, 10), (0, -1, 2, 1) 

(c) (1, 1, 1, 1), (1, 2, 3, 4), (1, 4, 9, 16), (1, 8, 27, 64). 

n 

3. Show that the vectors Xi = X) a *fih * = b 2, • • •, m> are linearly dependent 

j = 1 

if and only if the system of equations 

£l<*ll + ^2«21 + * * • + £m«ml = 0 

£l«12 + £2^*22 + ' * * + £m«m2 = 0 



£l<*ln + ?2«2 n + ‘ * ' + £m«mn — 0 

has a non-trivial solution (£ 1 , £ 2 , • • £m) = (0i,jft 2 , ‘ * ->Pm) (0, 0, • • •, 0). 

Use this to prove that any system (5) whose coefficients are in a division ring 
A has a non-trivial solution in A, provided the number m of unknowns exceeds 
the number n of equations. 

(A similar result can be proved for “right-handed” systems = 0 by 

using right vector spaces.) 

5. Invariance of dimensionality. A set of vectors (/) has been 
called a set of generators for 9? if every x can be expressed in the 
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form 2&/»- for suitable /*■ in (/) and suitable & in A. If e ly . 2 , 

• • •, e n is a basis, these elements are, of course, generators. More- 
over, they are linearly independent; for if 2/3*.* = 0, then 

jSi^i + jS 2 . 2 + * • * + = 0.1 + 0.2 + * * * + 0. n . 

Hence by the uniqueness of the representation, each = 0. 
Conversely, any finite set of generators /i,/ 2 , •••,/»» which are 
linearly independent form a basis. Thus if 2£/* = 2ry/ t -, then 
2(& — i7»)/i = 0. Hence £* — Vz = 0 and & = rn for / = 1, 2, 

• • - y m. It follows from Theorem 2 that the number w of vec- 
tors in any basis /i,/ 2 , • • *,/ m does not exceed w. By reversing 
the roles of the e s and the /’s, we obtain n < m. Hence m = n. 
This proves the following fundamental 

Theorem 3. Any basis of 9? contains n vectors . 

The number n of elements in a basis is therefore uniquely de- 
termined. We shall call this number the dimensionality of 9 X 
over A. 

We have seen that if SR and SR are equivalent free o-modules, 
then any basis e u e 2y • • •, e n for SR yields a basis e ly e 2 , • • •, e n 
for jR. It follows that equivalent vector spaces have the same 
dimensionality. In particular we see that the spaces A (m) and 
A (n) are not equivalent \i m ^ n. 

We prove next the following 

Theorem 4. If /i, f 2y • • • , f r are linearly independent , then we 
can supplement these vectors with n — r vectors chosen from a basis 
e u e 2 > • • • , e n to obtain a basis . 

Proof. We consider the set (/i,/ 2 , •••,/»■; e n ) y and 

we choose in this set a maximum linearly independent set (f ly 
/ 2 , • • - ,/ r ; * <2 , • * •, including the/*. If we add any of the 

.’s to this set, we obtain a dependent set. Hence by Lemma 1 
of § 4 every .»■ is linearly dependent on the set (/i, • • •, f r ; e iiy 

• • •, e ih ). Hence any x is dependent on this set, and the set is a 
basis. 

The number h of .’s that are added is, of course, n — r. In 
particular we see that, if r = w, /A.w the fi constitute a basis . 
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Suppose next that the vectors /i, f 2y • • • , f m are generators. 
We select from this set a maximal linearly independent subset, 
and we assume that the notation has been chosen so that /i,/ 2 , 

• • • , f r is such a subset. Then for any /, (/i, / 2 , • • • , f ry /,•) is a 
linearly dependent set. Hence /*• and consequently every x is 
linearly dependent on /i,/ 2 , • • - ,/ r . The latter set is therefore a 
basis, and, by Theorem 3, r = n. Thus we see that any set of 
generators contains at least n elements and contains a subset of n 
elements that forms a basis . 



EXERCISES 

1. If fi = (1, —1, 2 , 3) and /2 = (3, 0 , 4, — 2 ), find vectors fz and /4 so that 
/b/ 2 ,/ 3,/4 is a basis. 

2 . Find a maximum linearly independent subset in the following set of vectors: 

(2, -3, 0, 4), (-1, f, 0, -2), (1, -1, 2, 1), (6, -7, 8, 8). 

3. Prove that any finitely generated A-module, A a division ring, is a finite 
dimensional vector space. 

6. Bases and matrices. In considering finite sets of vectors, 
we shall now regard the order of these vectors as material. Thus 
we consider ordered sets. In particular we distinguish between 
the basis e\ y e 2y • • •, e n and the basis e iiy e hy • • •, e in where the i’s 
form a permutation of 1, 2, • • •, n. Let (e iy e 2y • • •, e n ) be a par- 
ticular ordered set which forms a basis and let (x u x 2y • • •, x r ) 

n 

be an ordered set of arbitrary vectors. We write #»• = “iA'j 

1 

2 = 1, 2, r. The elements a; ; - are uniquely determined. 
Hence the matrix 







'a xl 


Oi 12 


• • • a m 


( 6 ) 


(a) = 


&21 


«22 


• • • <*2» 






■OLrl 


Oir2 


* * * «rn 



is uniquely determined by the ordered set {x Xy x 2 , • • •, x r ) and 
the ordered basis (e 1} e 2 , ■ ■ e n ). We call this matrix the matrix 
of {x x , x 2 , • • •, x r ) relative to {e u e 2} • • •, e n ). 

It will be well to recall at this point the basic facts concerning 
matrix multiplication.* Let (a) be an r X n matrix ( r rows, n 

* Cf. § 4, Chapter II of Volume I of these Lectures. 
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columns) with elements in A. As above we denote the element in 
the (*, ^-position, that is, in the intersection of the zth row and 
yth column by a, Similarly let (0) be an n X m matrix with 
elements 0jk in A. We define the product (a)(0) to be the r X m 
matrix whose element in the (/', k) position is 

0) <?ik = + OCi2&2k + ' ' ' + OLin^nk- 



If (y) is an m X q matrix with elements in A, then the products 
[(a)(/3)](y) and (a)[(/3) (7)] are defined as r X q matrices. The 
( i , /) elements of these products are respectively 

Y {(Xifiik) 7 kh Y a H (PjkJk 1 ) • 

2,h j,k 



Thus we have the associative law: [(#)($)] (y) = («)[(d)(7)]- 
If we stick to square matrices of a definite size, say n X n, 
then the product is again a matrix of the same type. Since the 
associative law holds, we can say that the totality A„ of these 
matrices is a semi-group. Also it is immediate that the matrix 



n 



1 



1 = 



0 



10 1 J 



is the identity in A n in the sense that (a)l = (a) = 1(a) for all 
(a) e A n . As usual for semi-groups we call a matrix (a) a unit 
if there exists a (0) such that (a)(0) = 1 = (0)(a). These ma- 
trices are also called non-singular or regular matrices in A,,. It 
is easy to verify that the totality of units of any semi-group with 
an identity constitutes a group.* In particular the totality 
L( A, n) of non-singular matrices is a group relative to multipli- 
cation. As in any group the inverse (0) of (a) is uniquely de- 
termined. As usual we write (0) — (a) -1 . 

We return now to the consideration of finite dimensional vec- 
tor spaces. Let (ei, e 2 , • • •, e n ) and C/1,/2, • • •, /«) be ordered 
bases for the vector space over A and, as before, let (a) be the 

* See, for example, these Lectures, Volume I, p. 24. 
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matrix of (/*•) relative to (*,•). Next let (gi y g 2y • •, g n ) be a 
third ordered basis and let 

Pll Pl2 * • • Pi n 

p21 P 22 * * * P211 

Pnl Pn2 * * * P nn 

be the matrix of (^i, g 2y • • • , £ n ) relative to {f u f 2y • • •,/»). Then 
gj = 2p jk f k . Since /* = ^,a ki e iy 

gj = 2 Pjkfk = ^Pjk&ki^i = i 
where 7# = ^ Pjkocki- This shows that the matrix of (^i, g 2y 

k 

m • - , gn) relative to (e ly e 2y • • •, e n ) is the product (P)(a) of the 
matrices Q 3 ) and (a). If, in particular, gi = ei y then ( P)(a ) is 
the matrix of the (e u e 2 , •••, *») relative to (*i, •••, e n ). 

Since ei = e% y it is evident that this matrix must be the identity 
matrix 1 . Hence (P)(a) = 1 . By reversing the roles of (e ly e 2y 
• • •, e n ) and (/ij/2, * • - we obtain also (a)(0) = 1. Thus 
we have proved 

Theorem 5 . The matrix of any ordered basis (/1, f 2y • • • , f n ) 
relative to the ordered basis (e iy e 2y • • •, e n ) is non-singular . 

Conversely, let (a) be an element of L{ A, «). Let (0) = (a) L 
Define /*• by /,• = 2 a^y. Then we assert that C/1,/2, • • *,/ n ) is 
a basis for 9 1 over A. Thus the elements 

(8) E Pkifi = E Pkidijej = E s kjej 

i i,j j 

where S k j is the Kronecker “delta” that is, 8 k j = 0 if k 7^ j and 
= 1 if k = j. Thus 2 Pkifi = € k and the e k are dependent on the 
f s. Hence every x is dependent on the f s. Thus the f's are gen- 
erators. Since their number is n y they form a basis. 

We have therefore established a 1-1 correspondence between 
the different ordered bases and the units in A n : If (e iy e 2y • • •, e n ) 
is a particular ordered basis, then every ordered basis is obtained 
by taking a unit (a) in A n and defining /< = 2 c*,-y*y. 

There is no difficulty, of course, in duplicating the above re- 
sults for right vector spaces. We need only to settle on the defi- 
nition of the matrix of (xf y x 2 ' y • • •, xf) relative to the basis 
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(ef y *2 > • • e n ') for the right space 9J'. We do this by writing 
x/ = Xe/aij and by defining the matrix of (xf y x 2y •••, xf) 
relative to (ef y e 2 r y e n f ) to be (a). Thus in this case the 
matrix is the transposed * of the matrix which appears in the 
equations x/ = As before we obtain a 1-1 correspond- 

ence between the various ordered bases and the elements of 
L{ A, n). 

EXERCISES 

1. Prove that, if (an, an, • • •, «in), (« 2 i, « 22 , • • *, a 2n ), • • («ri, a r2 , • • •, a rn ) 

are (left) linearly independent, then there exist / = r + 1, • • *, n, j — 1, 2, 
•••,», such that (a) = (a*/) is a unit. 

2. Let A be a finite division ring containing q elements. Show that the num- 
ber of units in A n is 

N = (q n — 1) (q n — q) • • • ( q n - q n ~ x ) • 

7. Applications to matrix theory. The correspondence between 
bases and units in A n enables us to apply our results on bases to 
obtain some simple but non-trivial theorems on matrices with 
elements in a division ring. We prove first the following 

Theorem 6. If ( a ) and ((3) e A n and (13) (a) = 1, then also 
(a)(0) = 1 so that (a) and (0) e L(A y n). 

Proof. If (0)(a) = 1, the equation (8) shows that if /,• = 
'Eaijej, then the e & are dependent on the /’s. The argument given 
above then shows that the f s form a basis. Hence the matrix 
(a) of (/i,/ 2 , * * - ,/ n ) relative to (*i, • • •, * n ) is a unit. Since 

the inverse is unique, it follows that (0) = (a) -1 . 

Theorem 7. 7/ (a) is not a right (left) zero divisor in A ny then 
(a) e L(A y n). 

Proof. We have to show that the vectors /*• = form a 

basis. By Theorem 4 it suffices to show that the/'s are linearly 
independent. Suppose therefore that 2/3;/; = 0. Then 20;a z -.^y 
= 0 and hence ^ = 0 for j = 1, 2, - y n. Thus if 

'01 02 * * * 0n~ 

00 ••• 0 

00 ••• 0 

* The transposed of the matrix (a i; ) is the matrix with element otij in its (J, O-Porifiom 
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then (/3)(a) = 0. Since (a) is not a right zero divisor, this im- 
plies that (/8) = 0. Hence each /3,- = 0. This proves that the 
fi are linearly independent and completes the proof for the case 
in which (a) is not a right zero divisor. The proof for the case 
(a) not a left zero divisor can be obtained in the same way by 
using right vector spaces. The details are left to the reader. 

We shall obtain next a set of generators for the group L( A, n ). 
Consider the matrices of the form 



fl 




Tpqifi) 



13 



1 



P 



1J 



fl 



P 



A>( y) = 



l • 



y 



1 



7^0 



1J 
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fl 



P 






1 



0 ••• 1 
1 



1 

1 ••• 0 



1 



p 



q 



l lJ 

in which the elements not indicated are 0 . We call these matrices 
elementary matrices of respective types I, II and III. These ma- 
trices belong to L(A y n ); for T pq (fi)~ l = T pq { — $) y D p {y)~ l = 
D p ( 7 -1 ) and P pq = P pq . We shall now prove the following 

Theorem 8. Any matrix (a) in L{A y n) is a product of elemen- 
tary matrices. 

Proof. We note first that, if (/i, /2, •••,/») is an ordered 
basis, then so are the following sets: 



{flyf^y ' * 'yfp—lyfp yfp+ly * * * ) ] n) y fp = fp “f" fif qy Q 7^ P 

^flyf^y ’ * ' y fp—ly fp yfp+ly * * ’ yfn)y fp = Iffpy Y 7^ 0 
C/lj * * ‘ yfp — ufp yfp+ly * * ' y f q—ly f q yfq+ly * * ’ yfn)y 
fp ~ fqy fq = fp • 



Moreover, the matrices of these bases relative to C/1,/2, * • -,/ n ) 
are elementary matrices of types I, II, or III. 

Now let (a) be any matrix in L( A, n) and define /*• = 
where the ^s constitute a basis for an n dimensional vector space. 
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Then (/i,/2, * * *>/n) is an ordered basis. We wish to show that 
we can go from this basis to the basis (e u e 2 , • • • , e n ) by a sequence 
of “elementary replacemen ts” of the types indicated above. 
This is trivial if n = 1 , and we can suppose that it has already 
been proved for (n — 1 ) dimensional vector spaces. Now the /*• 
cannot all belong to [e 2y e 3y • • •, * n ]. Hence one of the an, say 
c*pi, is 0 . We interchange f ly f p to obtain the basis (//, f 2y 

• • - y fp—u fp y f py * * *,/ n ) in which // has a non-zero coefficient 
for ei in its expression in terms of the Next we replace f 2 by 
f 2 * = f 2 + f 3 fi where 0 is chosen so that f 2 * s [e 2y e 3 , • • •, 

A sequence of such elementary replacements yields the basis 
•••,/«*) where the/** c [e 2 , e 3y • • •, <?»]. The vec- 
tors /2*,/3*> * * - ,/n* are linearly independent so that they con- 
stitute a basis for [e 2y e 3y • • •, * n ]- Hence by the induction as- 
sumption we can pass by a finite sequence of elementary replace- 
ments to the basis (//, e 2 , * 3 , • • •, * n ). Next we obtain (/i", e 2y 
e 3y • * *») i n which /i" = /i' + /z * 2 does not involve e 2 . A finite 

sequence of such replacements yields (ye ly e 2y • • •, £ n ) and then 
(e u • * *5 €n) • We can now conclude the proof; for the ma- 
trix (a) of the basis C/1,/2, •••,/«) relative to the basis (*1, 

• • • , £ n ) is the product of the matrices of successive bases in our 
sequence, and these are elementary matrices. 



EXERCISES 

1. Express the following matrix as a product of elementary matrices 



• 2 


-1 


1 


-i-i 


-5 


-3 


1 


1 


2 


3 


-1 


0 


--3 


-1 


0 


1. 



2. Verify that 

G i] - G !) L! G !] n a- 

Generalize this result and use the generalization to prove that the elementary 
matrices of types I and II suffice to generate L( A, n). 

3. Prove that, if 8 5 * 0, £ is a product of elementary matrices of type 

1. Hence prove that any matrix in L( A, n) has the form (J3)D n (y) where (J3) is a 
product of elementary matrices of type I and D n { y) is defined above. 
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8. Rank of a set of vectors. Determinantal rank. Let = 

( x a ) be an arbitrary subset of the vector space 9? and as before 
let [ S ] denote the subspace spanned by S . If (x ly x 2y •• •, x r ) is a 
maximal linearly independent set of vectors chosen from the set 
S y then every vector in S and hence in [S] is linearly dependent 
on the X{. Hence (#i, x 2 , • • *, x r ) is a basis for [S], The theorem 
on invariance of dimensionality now shows that r is uniquely de- 
termined by Sy that is, any two maximal linearly independent 
subsets of a set S have the same cardinal number. We call this 
number the rank of the set S . Of course, the rank r is < n and 
r = n if and only if [S] = 9?. These remarks show in particular 
that, if S = © is a subspace, then © = [S] is finite dimensional 
with dimensionality < n . Moreover dim © = n only if © = 9L 
We shall now apply the concept of rank of a set of vectors to 
the study of matrices with elements in a division ring A. Let 
(a) be an arbitrary r X n matrix with elements in A and let (e ly 
e 2y • • •, e n ) be an arbitrary ordered basis for 9?. We introduce 



the row vectors Xi = a ij e j> * = 1, 2, • • •, r, of 9? and we define 
/= i 

the row rank of (a) to be the rank of the set (x u x 2y • • •, x r ). A 
different choice of basis yields the same result. For, if (f ly / 2 , 
• • • , f n ) is a second basis for 9? (or for another ^-dimensional 
space), then the mapping is an equivalence which 

maps Xi into yi = Soj^/y. Hence dim [x u x 2y • • •, x r \ = dim [y l9 

y 2) • * ’ y jy r] • 

In a similar fashion we define the column rank of (a). Here 
we introduce a right vector space 9L of r dimensions with basis 
(ei> e 2 y • • •, e r Then we define the column rank of (a) to be 
the rank of the set (*T, x 2y • • *, x n ') where x/ = 'Le/aji. The 
x/ are called column vectors of (a). We shall prove in the next 
chapter that the two ranks of a matrix are always equal. In 
the special case A = $ a field (commutative) this equality can 
be established by showing that these ranks coincide with still an- 
other rank which can be defined in terms of determinants. 

We recall first that a minor of the matrix (a), in <£, is a de- 
terminant of a square matrix that is obtained by striking out a 
certain number of rows and columns from the matrix ( a ). For 

example, the minors of second order have the form ^ pr 



OLn 



OLa 
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We say that (a) has determinantal rank p if every (p + l)-rowed 
minor has the value 0, but there exists a p-rowed minor 0 in 
(a). The following theorem will enable us to prove the equality 
of row rank and determinantal rank. The proof will make use 
of well-known theorems on determinants. 



Theorem 9. The vectors Xi = 2a,-y*y, / = 1, 2, • • *, r, are lin- 
early independent if and only if (a) is of determinantal rank r . 



Proof. Evidently the determinantal rank p < n. Also the x’s 
are linearly independent only if r < n. Hence we may assume 
that r < n. Suppose first that the x’s are dependent, so that, 
say, *1 = /5 2 at 2 + * * * + j8 r ff r . Then = /3 2 a 2 j + + * — h 

p r a r j for j = 1, 2, • • *, n. Hence 



(«) 



13 $k<Xh 1 13 
2 2 



« 21 



<*22 



^ i fik&kn 
2 

Oi2n 



Oi r i Oir2 * * * <X rn J 



Since the first row of any r-rowed minor is a linear combination 
of the other rows, each r- rowed minor vanishes. Hence p < r. 
Conversely, suppose that p < r. It is clear that the determinan- 
tal rank is unaltered when the rows or the columns of (a) are 
permuted. Such permutations give matrices of the x’s in some 
other order relative to the ^s in some other order. Hence there 
is no loss in generality in assuming that 





"an 


a i2 


• Ip' 


0 = 


«21 


22 * 


* «2p 




^pi 


<* P 2 * ' 


a PP- 



Now let j3 *•, i = 1, 2, • • •, p + 1, be the cofactor of a< fP+ i in 



an 


Oi 12 


<*2 1 


«22 


^p-j-i,i 


^p+1,2 



* • * «i, P +i 

^2,p+l 

• • • a P +i, P +i 




24 



FINITE DIMENSIONAL VECTOR SPACES 



Then fi p+l = ^ 0 and p x a iy + j8 2 a 2 y H h j8p+ia p +i,y = 0 for 

j = 1, 2, Hence 0!#! + /3 2 tf2 + * • • + = 0 where 

jd p+ i ^ 0. Thus the x’s are dependent. This completes the proof. 

Again let r be arbitrary and assume that the vectors x ly x 2y 
• • *, x p form a basis for the set of x’s. Then by the above theo- 
rem there exists a non-vanishing p-rowed minor in the first p 
rows of (a). Moreover, since any p + 1 x’s are linearly depend- 
ent, every p + 1 -rowed minor in (a) vanishes. Hence the de- 
terminantal rank equals the row rank p. If we apply the same 
arguments to right vector spaces, we can show that the column 
rank and the determinantal rank are equal. As a consequence, 
we see that in the commutative case the two ranks (row and 
column) of a matrix are equal. 

We have seen that the matrix (a) e Z,(4>, n) if and only if the 
row vectors (x iy x 2y • • x n ) y Xi = '2a ij ej y form a basis for 9?. The 
latter condition is equivalent to the statement that the row rank 
of (a) is n. Hence the above result shows that (a) e /,(<£, n) if 
and only if the determinant of this matrix is not zero in 4>. This 
result can also be proved directly (cf. these Lectures, Volume I, 
p. 59). As a matter of fact, the inverse of (a) can be expressed 
in a simple fashion by means of determinants in the following 
way. Let Aij be the cofactor of the element ay* in (a) and set 
i Sij = Aij[ det (a)] -1 . Then (/3*y) = (a) -1 . This follows easily 
from the expansion theorems for determinants. A proof is given 
in Volume I, p. 59. 



EXERCISES 

1. Prove that if A = $ is commutative and the elements a* are all different, 
then 



1 


OL\ 


ai 2 


a\ n 1- 


1 


<X2 


Ct2 2 


••• c*2 n-1 


1 


Oi n 


Oin 2 


• • * Oin U ~ l - 



is in £(<£, n). (Hint: The determinant of this matrix is a so-called Vandermonde 
determinant. Prove that its value is life — OLj).) 

i>j 

2. Prove that if A = is a field and (a) 8 n) y then the transposed matrix 
(a)' 8 L($, n). 

3. Prove the following converse of Ex. 2: If (a)' e L( A, 2) for every (a) 8 L( A, 2), 
then A is a field. 
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4. Calculate the inverse of 

T -1 2 3- 

0 1-11 
2 1 10 * 

.3 -5 1 7. 

9. Factor spaces. Any subspace © of 9t is, of course, a sub- 
group of the additive group 9J. Since 9? is commutative, we can 
define the factor group = SR/©. The elements of this group 
are the cosets x = x + ©, and the composition in $ is given by 

x + y = x + y. 



Now let a be any element of A. Then if x = y (mod ©), that is, 
x — y = z e ©, also az e ©; hence ax = ay (mod ©). Thus the 
coset ax is uniquely determined by the coset x and by the ele- 
ment a e A. We now define this coset to be the product ax y 
and we can verify without difficulty that A and the composi- 
tion (a, x) — » ax constitute a vector space. We shall call this 
vector space the factor space of 9t relative to the subspace ©. 

Now let {fiy f 2y • • -yfr) be a basis for ©. We extend this to 
a basis C/1,/2, • * -,/r>/r+ 1> • * *>/n) for 9 f, and we shall now show 
that the cosets f r + 1 , •••,/« form a basis for $ = 9 ?/©. Let * 

n n n 

be any coset and write x — ^2 a*/,; then ^ =22 «»•/* = 23 «»/» 

i ii 

= E «*/*• = E «;/y since/, = 0 for i < r. Thus {f r +u • • -,/») 

1 r + l 

n 

is a set of generators for On the other hand, if 23 Pjfj = 0, 

r + l 

n nr 

then 23 &/?' e © and so 23 Pjfj = Z) 7kfk- This implies that all 

r + l __ r+l _ 1 

the 1 6 j = 0. Thus (/ r +i, * * *>/n) is a basis. We have therefore 
proved that the dimensionality of 9? is the difference of the di- 
mensionalities of 9land of ©. 

10. Algebra of subspaces. The totality L of subspaces of a 
vector space 9 1 over a division ring A constitutes an interesting 
type of algebraic system with respect to two compositions which 
we proceed to define. We consider first the system L relative to 
the relation of set inclusion. With respect to this relation L is a 
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partially ordered set* By this we mean that the relation ©x 3 ©2 
is defined for some pairs in L and that 

1 . © 3 

2. if ©x 3 © 2 and © 2 3 then ©1 = © 2 , 

3. if ©1 3 ©2 and © 2 2 © 3 , then ©1 2 © 3 . 

Thus the relation is reflexive, asymmetric and transitive. 

Consider now any two subspaces ©x and © 2 . The logical in- 
tersection ©x fl © 2 is also a subspace, and this space acts as a 
greatest lower bound relative to the inclusion relation. By this 
we mean that ©x fl © 2 is contained in ©1 and © 2 , and ©x fl © 2 
contains every ©' which is contained in ©x and © 2 . The set 
theoretic sum ©x U © 2 of two spaces need not be a subspace. 
As a substitute for this set we therefore take the space [©1 U © 2 ] 
spanned by the set ©x U © 2 . We denote this space by ©1 + © 2 
and we call it the join of ©x and © 2 . It has the properties of a 
least upper bound: ©1 + © 2 2 ©1 and © 2 , and ©x + ©2 is con- 
tained in every subspace © which contains ©x and © 2 . It is 
immediate that these properties characterize ©x + © 2 > that is, 
any subspace that has these properties coincides with ©x + © 2 . 
Also it is immediate from this characterization or from the defi- 
nition of ©x + ©2 as [©x U © 2 ] that this space is the set of vec- 
tors of the form y\ + y 2 where the yi e ©*. 

A partially ordered set in which any two elements have a 
greatest lower bound and a least upper bound is called a lattice . 
Hence we call L the lattice of subspaces of the space 9?. In this 
section we derive the basic properties of this lattice. First we 
note the following properties that hold in any lattice. 

1. The associative and commutative laws hold for the compo- 
sitions fl and +. 

These follow easily from the definitions. The rules for fl are, 
of course, familiar to the reader. 

We note next some special properties of the lattice L . 

2. There exists a zero element in L, that is, an element 0 such 

that ^ 

© fl 0 = 0 and © + 0 = © 

for all ©. 

* Cf. Volume I, Chapter VII, for the concepts considered in this section. 
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The subspace consisting of the 0 vector only has these proper- 
ties. Dually the whole space 3 ? acts as an “all” element in the 
sense that 

© + 9 ? = 3 ? and © 09 ? = © 

for all ©. 

The distributive law ©i D (©2 + ©3) = ©1 D @ 2 + ©1 0 ©3 
does not hold without restriction in L. For example, let X\ and 
x 2 be independent vectors and set ©1 = [vj], © 2 = [v 2 ] and ©3 
= [xi + x 2 }. Then © 2 + ©3 = [*i> *2] so that ©1 fl (© 2 + ©3) 
= ©j. On the other hand, ©1 D © 2 and ©1 fl ©3 = 0 so that 
©1 fl © 2 + ©1 D © 3 = 0 . We shall show that a certain weak- 
ening of the distributive law does hold in L. This is the follow- 
ing rule: 

3 . If ©1 2 ©2, then ©1 0 (© 2 + ©3) = ©1 fl © 2 + © x fl © 3 

= © 2 + ©1 n © 3 . 

Proof. We note first that ©1 fl @ 2 Q ©1 D (© 2 + ©3) and 
©1 H © 3 C ©x fl (© 2 + ©3). Hence 

©i n © 2 + ©x n ©3 c ©1 n (@ 2 + © 8 ). 

Next let z e ©x D (© 2 + ©3). Then z = y x in ©x and z = y 2 + 
y 3 where y 2 and y 3 are in © 2 and ©3 respectively. Hence y 3 = 
y 1 — jy2 e ©x + ©2 = ©x. Thus y 3 e ©1 fl © 3 and 2 = y 2 + y 3 e 
@2 + ©1 0 ©3- This proves that ©x ft (© 2 + ©3) Q ©2 + ©1 
D ©3. Hence 3 . holds. 

A lattice in which 3 . holds is called a modular (or Dedekind) 
lattice. We shall show next that L is a complemented lattice in 
the sense that the following property holds: 

4 . For any © in L there exists an ©* in L such that 

© + ©* = 9?, © n ©* = o. 

Proof. If (/1, / 2, •••,/,) is a basis for ©, these vectors are 
linearly independent and can therefore be supplemented by vec- 
tors / r+1 , •••,/« to give a basis (/x,/ 2 , •••,/») for 9 ?. We set 
©* = [/ r+ x,/ r+ 2, • • •,/„]. Then © + ©* = L/1,/2, • • •,/»] = 9 b 
Moreover, any vector y in © fl ©* is linearly dependent on /1, 
f 2i ■ ■ ' yfr and on/ r+ i,/ r +2, * ■ ■ , J W Since /x,/ 2 , • • • >fn &re lin- 
early independent, this implies that y = 0 . Hence © D ©* = 0 . 
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A subspace ©* satisfying the above condition is called a com- 
plement of the subspace © in 9?. We note finally that the follow- 
ing chain conditions hold in L : 

5. If ©i 2 ©2 2 • * * is an infinite descending chain of sub- 
spaces, then there exists an integer r such that © r = © r+ i = • • *. 

If ©i C © 2 C • • • is an infinite ascending chain of subspaces 
then there exists an integer r such that © r = © r +i = • • •. 

Both of these are clear since the dimensionality of a subspace 
is a non-negative integer * and since © Z) ©' implies that dim © 
> dim ©'. 



EXERCISES 

1 . Prove that, if ©i U ©2 = ©1 + ©2> then either ©1 3 ©2 or @2 2 ©1. 

2 . Prove that, if dim © = r, then the dimensionality of any complement is 
n — r. 

3 . Prove the general dimensionality relation: 

dim (©1 + ©2) = dim ©1 + dim ©2 — dim (©1 fl © 2 ). 

4 . Show that if © is any subspace 9 ^ 0 and 5^ 9 J, then © has more than one 
complement. 

11. Independent subspaces, direct sums. We consider next a 
concept which we shall see is a generalization of the notion of 
linear independence of vectors. Let © 1 , © 2 , •••, @ r be a finite 
set of subspaces of 9?. Then we say that these subspaces are in- 
dependent if 

( 9 ) ©< n (©!+■•■ + ©;_1 + ©; + ! + . . . + ©r) = 0 

for / = 1, 2, • • *, r. If Xiy x 2y • • •, x r are vectors in 9i, then 
necessary and sufficient conditions that linear independence holds 
for these are: 1) Xi 7 * 0 for / = 1, 2, • • •, r; 2) the spaces [xi] are 
independent. Thus suppose that 1) and 2) hold and let S/S*** = 0. 
Then ft#z ^ ] PjXj e \xl\ D ([#1] d~ * * * d~ \%i — 1] d~ [^z+i] d~ 

j 9±i 

+ [v r ]). Hence by 2), — ft** = 0. Since Xi 9 ^ 0, this implies 
that each ft = 0. Next assume that the Xi are linearly independ- 
ent. Then certainly each Xi 9 ^ 0. Furthermore, if x e [x^ fl 
([# 1 ] d~ * * * d” 1 P^i — 1 ] d - [^i+i] “T * * ’ d~ L^v])* then x = ft#* 

* We assign to the space 0 the dimensionality 0. 
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7: fljXj. Hence by the linear independence of the x’s, j8,- = 0 

3 & 

and so x = 0. 

Let ©i, © 2 , •••, © r be arbitrary independent subspaces and 

set © = ©i + © 2 H b © r - If y e ©, jy = jvi + y 2 H b 

jy r where jy» e ©*. We assert that this representation is unique, 
that is, if jy = y \ + y* + * • • + y r ' where y/ e ©»•, then y t = y /, 
i = 1, 2, • ■ • , r. Thus if 2y»- = 2jy/, then 2z»- = 0 for z»* = jy; — y/ 
in ©*•. Then 

— Zy 8 ©» fl (©i + * * * + ©i_i + ©i + 1 + * * * + ©r)* 

j 9±i 

Hence z* = 0 and jy t - = jy/. The converse of this result holds 
also; for if (9) fails for some /, then there is a vector z* 0 in 
this intersection. Thus z* = ^ zy, and we have two distinct 

j 5* i 

representations of this element as a sum of elements out of the 
spaces ©*;. We have therefore proved 

Theorem 10. A necessary and sufficient condition that the spaces 
©i, © 2 , • • •, © r be independent is that every vector in © = ©i + 
©2 + * * • + ©r have a unique representation in the form 2jy*, yi in 
©;. 

A second important characterization of independence of sub- 
spaces is furnished by 

Theorem 11. The spaces ©»• are independent if and only if 
dim (©i + © 2 + * * • + ©r) = 2 dim ©<. 

Proof. Suppose first that the ©*• are independent and let 
(fu,f 2 i> * * - ,/»*») be a basis for ©»■. Then if 2j8y*/yi = 0, Zy f = 0 
where jy* = X Pafa e ©». Hence for each /, 0 = jy z * = 2j8y,-/y t -. 

3 

Then j8y» = 0 since the fji for a fixed / are linearly independent. 
This proves that all the f s are linearly independent. Hence the 
f s form a basis for © = ©i + © 2 + • • • + © r - Their number 
2 ni y where ni = dim ©*, is the dimensionality of ©. Thus 
dim © = 2 dim ©*. Conversely suppose that this dimensionality 
relation holds and, as before, let the /y» form a basis for ©*. The 
number of these f s is 2 dim ©» = dim ©. On the other hand, 
these fji are generators for ©. It follows that they form a basis, 
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and consequently they are linearly independent. It follows di- 
rectly from this that, if = 2 y/, y iy y/ in ©*, then y { = y/. 
Hence the 0* are independent. 

If 9ti, $ 2 , • • •, are independent subspaces and 9? = 9ii + 
9?2 + • • * + 9? r , then we say that 9i is a direct sum of the subspaces 
9t;. We indicate this by writing 9t = 9ii © 9? 2 ® * * * © 9f r . If 
this is the case, every vector of 9t can be written in one and only 
one way as a sum of vectors in the subspace 9?;. 

EXERCISES 

1. Prove that the following are necessary and sufficient conditions that the 
subspaces 0* be independent. 

0! n 02 = 0, (0i + 0 2 ) 0 03 = 0, 

( 0 i + 02 + ©s) n ©4 = 0, • • • • 

2. Prove that if 9? = 9?i © 9?2 © * • • © 9f r and each 9?; = 9?a © • • • © 9f; ni -, 
then 9? = 9tll © • * * ® 9Jlni © 9^21 © * • • © 9?2n 2 © • • * © 9 ?rn r * 
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LINEAR TRANSFORMATIONS 



In this chapter we discuss the simplest properties of linear 
transformations and of certain algebraic systems determined by 
these mappings. Two particular types of linear transformations 
are of special interest, namely, the linear transformations of a 
vector space into itself and the linear transformations of a space 
into the one-dimensional space A. The former type constitute 
a ring while the latter, called linear functions, form a right vec- 
tor space. There is a natural way of associating with a linear 
transformation of the vector space 9?i into the vector space dt 2 
a transposed linear transformation of the conjugate space of linear 
functions on into the conjugate space of 9?i. We consider 
the properties of the transposition mapping. The relation between 
linear transformations and matrices is discussed. Also we de- 
fine rank and nullity for arbitrary linear transformations. Finally 
we study a special type of linear transformation called a projec- 
tion, and we establish a connection between transformations of 
this type and direct decompositions of the vector space. 

1. Definition and examples. The differentiation mapping 
<t>(X) — > <£'(X) in the vector space <3>[X] of polynomials with real 
coefficients has the properties 

[$(x) + my = 0 r (X) + ^'(X), [« 0 (X)]' = a0'(x). 

This is an example of a linear transformation. Another example 
is the mapping of A (3) defined by 

(£> Vy f) -> ?X + rjfi + 
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where X, n y v are fixed elements of the basic division ring A. In 
general, let 9ii and 9 1 2 be vector spaces over the same division 
ring A. Then we call a mapping A of 9f x into 9f 2 a linear trans- 
formation if 

(1) (x + y)A = xA + yA, (ax) A = a(xA) 

for all x y y in 9? i and all a in A. As usual xA denotes the image 
in 9t 2 of the element x. We shall also denote the image set, that 
is, the set of images xA, by 9 X\A. The statement that A is a 
mapping into 9J 2 allows the possibility that dliA cz 9?2 (i.e. 9f \A 
is a proper subset of 9? 2 ). 

The concept of linear transformation is a special case of that 
of o-homomorphism of one o-module 9? x into a second one. The 
generalization is obtained by replacing “a in A” by “a in o” in 
the above definition. It may be recalled that o-homomorphisms 
which are 1-1 have been introduced in Chapter I. Such mappings 
have been called equivalences or o-isomorphisms. The existence 
of an o-isomorphism of 9J X onto 9 1 2 is our criterion for equivalence 
of the modules 9f*. 

The first condition in (1) states that A is a homomorphism of 
the additive group 9? x into the additive group 9?2 while the sec- 
ond can be interpreted as a type of commutativity of A with a. 
This is strictly the case when 9? i = 9?2 = 9f; for we can introduce 
for each a the mapping ai which sends x into ax. We call ai the 
scalar multiplication determined by a. It is clear that a\ is an 
endomorphism of 9?, that is, a homomorphism of 9?, regarded as 
a group, into itself. Now x(a\A) = (ax) A and x(Aal) = a(xA)\ 
hence A is a linear transformation in the vector space 9? if and 
only if A is an endomorphism of 9i which commutes with all the 
endomorphisms a\. 

Besides the linear transformations of a vector space into itself 
a second noteworthy type of linear transformation is a linear 
function . This is defined to be a mapping x — > f(x) of a vector 
space 9f into the division ring A such that 

(2) f(x +y) =f(x) +f(y)y f(ax) = af(x). 

It is clear that a mapping of this type can be regarded as a linear 
transformation of 9? into the one-dimensional vector space A. 
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The latter is obtained by using the additive group of A as group, 
A as division ring and left multiplication a £ as multiplication by 
scalars. The element 1 (or any non-zero element) is a basis for 
A over A. The second example considered above is an instance 
of a linear function on A (3) . 

EXERCISES 

1. Show that the differentiation mapping <£(X) — > $'(A) is a linear transforma- 
tion in the vector space 9? of polynomials of degree <n. 

2. Show that the difference operator $(A) — » <£(A + 1) — <£(A) is a linear trans- 
formation in $[X]. 

2. Compositions of linear transformations. We consider now 
ways of combining linear transformations. It should be remarked 
that all the results of this section apply equally well to the more 
general case of o-homomorphisms of modules. However, for the 
sake of simplicity we shall state the results only for the special 
case that is of primary interest in the sequel. 

Suppose first that A and B are linear transformations of a vec- 
tor space 9?i into the same space 9? 2 . We define a mapping 
A + B of 3?! into 9t 2 by the equation 

(3) x(A -f- B ) = xA -f- xB 

for any x in 9ti. Thus to obtain the effect of A + B on x we add 
the images xA and xB. Clearly A + B is a (single-valued) trans- 
formation of into 9? 2 . Since 

(x + y)(A + B) = (x +y)A + (x +y)B = xA +yA + xB + yB 
= xA + xB + y A + yB — x{A + B) +y(A + B) 

and 

(ax) (A + B) = (ax)A + (ax)B = a(xA) + a(xB) 

= a(xA + xB) = a(x(A + B)) y 

A + B is a linear transformation of 9ti into 9t 2 . 

We now denote the totality of linear transformations of 9?i 
into dt 2 by £($Ri, 9 ? 2 ), and we shall show that this set, together 
with the addition composition just introduced, is a commutative 
group. We note first that the associative and commutative laws 
hold; for we have the following relations: 
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x[(A + B) + C] = x(A + B) + xC = xA + xB + xC , 

x[A + (B + C)] = xA + x(B + C) = xA + xB + xC> 

x(A + B) = xA + xB y x(B + A) = xB + xA . 

Thus {A + B) + C and + {B + C) have the same effect on 
any x in 9 t l5 and this is what is meant by saying that the trans- 
formations A + (B + C) and {A + B) + C are equal. Simi- 
larly A + B = B -f- A. Next we define the mapping 0 by the 
condition *0 = 0 , the zero vector in 9 t 2 . It is immediate that 
this mapping is in 8(9ii, 9? 2 ) and that A + 0 = A = 0 + A {or 
all A in 8(9ii, $R 2 ). Hence 0 acts as identity element for the ad- 
ditive composition. Finally, if A is any member of 8(9li, SR 2 ) 
we define —A to be the mapping such that x( — A) = —xA. It 
is easy to verify that —A e 8(9?i, 9 ? 2 ). Moreover, —A acts as 
the inverse of A since x{A + ( — A )) = xA — xA = 0 for all x. 
This completes the verification that S( 9 ?i, 9 ? 2 ), + is a commuta- 
tive group. 

We introduce next a second composition for linear transforma- 
tions. This is defined for any A in 8 ( 9 ?i, 9 t 2 ) and any B in 
8(9l 2 , $ 3 ), and it is taken to be the resultant of A followed by B . 
As usual, we denote the resultant as AB. Hence by definition 
x(AB) = ( xA)B . Consequently 

(x + y)(AB) = ((, x +y)A)B = {xA + y A) B = ( xA)B + (yA)B 
= x(AB) + y(AB) 

and 

(ax)(AB) = (( ax)A)B = ( a(xA))B = a((xA)B) = a(x(AB )). 

This shows that AB e 8(9li, 9 J 3 ). 

As is well known, the product AB is an associative one, that 
is, if A e 8 ({Ri, ^ 2 ), B e 8(9t 2 , 9? 3 ) and C e 8(0? 3 , then 

( 4 ) (AB)C = A(BC); 

for 



and 



x{{AB)C) = (x(AB))C = ((, xA)B)C 
x{A{BC)) = (xA)(BC) = {{xA)B)C. 
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We prove next the important distributive laws: If A e 8 ( 9 ii, 9 t 2 ), 
B, C e 8 ( 9 i 2 , $3) and D e 8 ( 9 i 3 > $4), then 

( 5 ) A(B + C) = AB + AC , ( B + C)D = BD + CD. 

These follow from the following equations: 

x(A(B + O) = (xA)(B + C) = (xA)B + (xA)C 

= x{AB) + x(AC) = x(AB + AC) 

x((B + C)D ) = (xB + xC)D = ( xB)D + ( xC)D 

= x(BD ) + x(CD) = x(BD + CD). 

We now specialize the foregoing results to the case of the linear 
transformations in a single vector space 9?. It is clear that 
8 = 8 ( 91 , 9t), +, • is a ring; for 8(9?, 9£), + is a commutative 
group, 8(9i, 9?) is closed under •, and this composition is associa- 
tive and distributive with respect to addition. It is evident also 
that 8 contains the identity mapping x — > x and that this map- 
ping, denoted as 1 , is the identity in the ring 8 (i.e. Al = A = \A 
for all A) A 

Suppose next that <£ is the center of the ring A. Of course, $ 
is a subfield of A. We observe that the scalar multiplications 
y 1 determined by the elements 76$ are linear transformations; 
for, 7 1 is an endomorphism and (ax)yi = y(ax) = ( ya)x = (ay)x 
= a(yx) = a(xyi). Thus if $1 denotes the set of multiplications 
by the elements 7 of <£, then 8 2 In particular if A = $ is 
commutative, then 8 contains all the scalar multiplications. We 
now show that any of the groups 8(9?i, 9? 2 ) can be regarded as a 
vector space over the field $. For this purpose we define 7 A 
for 7 in and A in 8 ( 9 ii, 9 ? 2 ) to be the mapping x — > 7 (xA) = 
(7 x)A. Since this is the resultant of A and 7 1 (in 9? 2 ) or of 7 1 
(in 9ti) with A and each of these is linear, 7 A is in 8 ( 9 ti, 9 £ 2 ). It 

* A reader familiar with the theory of endomorphisms of a commutative group such as is 
given in Volume I, pp. 78-82, will note that these results can also be obtained by the follow- 
ing reasoning: The set @ of endomorphisms of 9i, + is a ring relative to the addition com- 
position x(A + B) = xA + xB and the multiplication composition as resultant. The set 
8 = 8(91, 9$) is the subset of © of elements commuting with the scalar multiplication ai. 
Since the totality of elements of a ring commuting with the elements of a given subset 
form a subring, it is clear that 8 is a subring of ®. 
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is easy to verify that the function 7 A satisfies the rules for mul- 
tiplication by scalars in a vector space. In this way we can re- 
gard 8(9?i* 9? 2 ) as a vector space over $>. 

If we combine the results of the last two paragraphs* we see 
that the set 8 = 8(9?* 9?) is at the same time a ring and a vector 
space over a field <£. The ring addition is the same as the vector 
space addition. Moreover* we have the following relations con- 
necting multiplication and scalar multiplication: 

(6) y(AB) = (7 A)B = A{yB). 

A system having these properties is called an algebra (or hyper- 
complex number system) over the field <E>. Hence when we wish to 
study 8 relative to all three operations at the same time* we shall 
refer to this system as the algebra of linear transformations in 9?. 



EXERCISES 

1. Show that 8(9?i> 9J 2 ) is an 8(9?i, 9?i)-module relative to the composition AX , 
A in 8(9?i, 9?i), X in 8(9?i, 9? 2 ) as the resultant linear transformation. Similarly 
show that 8(9?i, 9? 2 ) can be regarded as a right 8(9? 2 , 9? 2 ) module. 

2. Prove that if ai is a linear transformation, then a is in the center of A. 

3. Verify that, if C e 8(9? 2 , 9? 2 ) and X e 8(9?i, 9? 2 ), then the mapping X — > XC 
is an 8(9? 1 , 9?i) endomorphism of 8(9?i, 9? 2 ). 

3. The matrix of a linear transformation. We shall now show 
that a linear transformation of one finite dimensional vector space 
9?i into a second finite dimensional space 9? 2 can be completely 
described by means of a finite matrix with elements in the under- 
lying division ring A. 

Let 9?i* / = 1* 2* be dimensional* let (e ly e 2 * • • •* e ni ) be an 
ordered basis for 9?i* C/1,/2, * • - ,/ n2 ) an ordered basis for 9? 2 and 
let A e 8(9?i, 9? 2 ). We note first that the action of A on any x is 
determined by the images etA, i = 1* 2* • • •* n x . Thus x can be 

ni 

written as X) Hence xA = ('Z£ i e i )A = 2 {^e t )A = '2,^ i (e i A). 
1 

Thus xA is determined by the expression for x and by the images 
eiA . Now write 

(7) e^A OLi\f\ T" ^12 f 2 H h ®-in%f ntf i = 2* * j ^1* 
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Then we obtain the matrix 





«11 


«i 2 




(8) 


«21 


a 22 


a 2n 2 






^n22 


ot-mnv 



as the matrix of {e x A y • • *, e ni A) relative to (/i, • • *,/ n2 ). Clearly 
if the two ordered bases and the matrix (8) are known, then the 
effect of A on any x can be deduced; for (7) holds and as we 
have seen xA = ^ &ayy/y. 

This connection can be expressed in terms of matrix multipli- 
cation as follows. Write x = 2£*a- as a row (id, £ 2 > • • •, £ ni ) and 
similarly y = 2^/y as (rj i, rj 2y • • *, y n2 ). Then the “row vector” 
associated with jy = xA is obtained by performing the matrix 
multiplication: 







QJll 


<*12 * ' 




(9) 


(£l> ?2j - 


• • n a21 

5 sni/ 


a 22 


Of 2n2 








«m2 


Of nin2 . 



Thus xA = y = 2r;y/y where rjj = 2&a»-y, and this is what we ob- 
tain from (9). 

It should be emphasized that the matrix (a) depends on the 
choice of bases in the two spaces. For this reason we call (a) 
the matrix of A relative to the ordered bases (e iy e 2y • • •, e ni ) and 
(/i>/ 2 > * * * ,/ n2 ). If 9?i = 9?2 = then it is natural to use just 
one ordered basis, that is, to take the /* = In this case we 
refer to (a) as the matrix of A relative to (ei, e 2y • • *, e n ). 

The result that we have obtained is that any A s 8(9ii, 9? 2 ) de- 
termines an n i X n 2 matrix with elements in A. We now note 
the converse: that any X n 2 matrix defines a linear transfor- 
mation of dti into 9t 2 . We note first that, if (e iy e 2 , • • *, e ni ) is a 
basis in 9ii and (u u u 2y • • •, u ni ) is an arbitrary ordered set of 
vectors in 9t 2 , then there exists a linear transformation A map- 
ping ei into Ui for i = 1 , 2 , • • *, n Thus consider the mapping 
2£»A* — > Since there is only one way of writing a vector 

x e 9?i as 2£*<?y, this mapping is single-valued and, since any 
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x e dli can be written in the form the mapping is defined on 

the whole of 9ti. If jy = is a second vector in 9?i, y — > 

HrjiUi and x + y = 2(£; + rji)ei — ■» 2(£* + 77*)^; = + XrjiUi. 

Hence the transformation is a homomorphism. Moreover, a# = 
= a(E%iUi) so that the mapping is linear. 
Clearly d = 1 ^ 1«»* = as required. Now let (a) be any 

n x X ^2 matrix and let C/1,/2, * • *,/ n2 ) be a basis in 9? 2 . Then we 
define / = 1, 2, • • *, Wi, and we can determine a 

linear transformation A such that dA = U{. Evidently the ma- 
trix of A relative to (e ly e 2y • • •, e ni ), (/i,/ 2 , * * - ,/ n2 ) is the given 
matrix (a). We have, therefore, shown that the correspondence 
A — > (a) is 1-1 between 8(9? 1, dt 2 ) and the set of n 1 X n 2 ma- 
trices with elements in A. 



EXERCISES 



1. Let 9J be the vector space of polynomials of degree <n with real coefficients 
and let D denote the differentiation operator. Show that D is nilpotent in the 
sense that D n = 0. Determine the matrix of D relative to (1, X, • • •, A n_1 ) and 
also relative to (1, A/1!, • • •, A n ~ l /(n — 1)!). 

2. Let 9? be as in 1. and let U be the linear operator /(A) — » /(A + 1). Prove 
that 



U 



>+Mr+- 



• + 



D n ~ 1 
(»- 1 )!’ 



3. Determine the matrix of b = U — 1 relative to the basis ( eo , <?i, • • •, e n —\ ) 
where 



eo 



A(A - 1) ■ ■ • (A - / + 1) 



4 . Let 9? be the set of complex numbers regarded as a vector space over the 
subfield of real numbers. Show that the mapping x — > x (complex conjugate) is 
linear and determine its matrix relative to the basis (1, /). 



4. Compositions of matrices. As before, let 9 U, i = 1, 2, be 
rii dimensional vector spaces over A and let (e\ y e 2y •••, e ni ) y 
(fu f2> • • • , /n 2 ) be bases for these two spaces. Let A and B be 
linear transformations of 9? 1 into 9 l 2y (a) and (]8), respectively, 
their matrices relative to the given bases. Then 

(10) e t A = Sa„/ } ', e t B = S/3, 

Hence 

( 11 ) 



ei(A + B) = etA + eiB = S(ay + /3, •,•)/). 
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This shows that the matrix of A + B is obtained from the mat- 
rices (a) and (/3) by adding elements in the same position. Ac- 
cordingly we define the sum of two X n 2 matrices (a) and (/ 3 ) 
as the matrix whose (*>./)-element is a# + jS#. It is easy to 
verify that the set of n x X n 2 matrices is a commutative group 
relative to this addition. As a matter of fact, except for a differ- 
ence of notation, this is a special case of the result noted in Chap- 
ter I that the set of ^-tuples form a group under addition as 
addition of components. The 0 matrix is the matrix that has 0 
in every place, and — (a) has the element — a# in its (/, /) posi- 
tion. Now the result that we established in (11) — namely, that 
if A —> (a) and B — » (/3) in the correspondence between linear 
transformations and matrices, then A + B — > (a) + (j3) — is 
equivalent to the statement that A — > ( a ) is a group isomor- 
phism. 

We consider next a third vector space $R 3 with basis (g ly g 2 , 
• • *, g n 3 ). Let C be a linear transformation of 9 ? 2 into 9? 3 and let 
(7) be the matrix of C relative to (/i,/ 2 , * * - (gu £2, • * •, 

g„ 3 ). Then 

(12) fjC = 2 y jk gk 

so that 

(13) ei{AC) = ( X a nfi\C = X aij(fjC) = X anyjkgh- 

This shows that the matrix of AC has the element X a */Yifc in 

3 

its (/, &)-position; hence this matrix is the product (a)(7) as de- 
fined in Chapter I. 

The associative law for matrix multiplication has been estab- 
lished before (p. 16). We now prove distributivity. The (/, k ) 
element of [(a) + (P)](y) is 

22 + Pij)yjk 

3 

while the (/, k) element of (a)(7) + (/ 3 ) (7) is 

22 a ij 7 jk + 22 fiijyjk- 

3 3 
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Hence by the distributive law in A, 

[(«) + m(y) = (a)(7) + («(7). 

Similarly we can verify that 

(«W) + (7)] = (a) 03 ) + (a)( 7 ). 

We remark also that it is easy to deduce the associative and dis- 
tributive laws for matrices from the corresponding laws for linear 
transformations (cf. Exercise 1 below). 

We note next that the set of n 1 X n 2 matrices with elements in 
A can be regarded as a vector space (or a right vector space) 
over A. This is clear since the set of matrices is essentially the 
same as the set of riin 2 - tuples over A. As before, we define p(a) 
to be the n \X n 2 matrix whose elements are p times the corre- 
sponding elements of A. Clearly the vector space that we ob- 
tain in this way is n x n 2 dimensional. If we use our correspond- 
ence between matrices and linear transformations, we can carry 
over this discussion to the set S(3Ji, 9f 2 ) of linear transformations 
of 9ti into 9f 2 « However, unless A = $ is commutative, the sca- 
lar multiplication in S(9fi, 9t 2 ) obtained in this way depends on 
the choice of the bases in 9? 1 and 9 1 2 . 

On the other hand, let A = <£. Then, as we saw in § 2, there is a 
way of defining a multiplication of linear transformations by ele- 
ments of $ which is independent of the choice of bases. The prod- 
uct 7 A y 7 in $, is taken to be the resultant 7 \A. Relative to this 
composition ?(9ii, 3? 2 ) is a vector space over <£. We now note 
that, if (<?i, e 2y * * *> <?m) is a basis for dti, then e t yi = 7^. Hence 
the matrix of 7 1 relative to this basis is the diagonal matrix 

y 0 

7 

(14) diag {73 7, •■*, 7} = 

-0 7 

Consequently if (a) is the matrix of A relative to {e Xy e 2y • • •, e ni ), 
(fiy/2y * * m yf n ^)y then y(a) is the matrix of 7 A relative to this 
pair of bases. This means that the scalar multiplication 7(a) 
corresponds to 7 A. 
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Another way of stating this result is the following. Let A — » 
(a) be the correspondence which associates with a linear trans- 
formation A e 8(SRi, $ 2 ) its matrix ( a ) relative to the bases {e u 
e 2y •••, o* (/i 5 /2> Then this correspondence is an 

equivalence of the vector space 8(SRi, SR 2 ) onto the vector space 
of n x X n 2 matrices with elements in <i>; for we have seen that 
A — > (a) is a group isomorphism and we have just verified that 
y A — > y(a). Since the matrix space is n ^-dimensional, it fol- 
lows from this that 8(SRi, SR 2 ) is ^^-dimensional. This proves 
the following 

Theorem 1. Let SR*, i = 1, 2, be rii-dimensional vector spaces 
over a field <£ and let 8(SRi, SR 2 ) be the set of linear transformations 
of SRi into dt 2 . Define A + B> and y A for y in as above . Then 
8(SRi, 9? 2 ) is an n x n 2 -dimensional vector space relative to these 
compositions . 

We return now to the case of an arbitrary A, but we specialize 
by taking 9L = dt 2 = SR. Also we take the fi = e* so that ( a ) 
is now the matrix of A relative to the single basis (e ly e 2 > • • e n ). 
In this case we have a correspondence A — » ( a ) of the ring 
8 = 8 (SR, SR) onto the set A n of n X n matrices. The addition 
and multiplication compositions introduced in A n turn this set 
into a ring. Also our results show that, if A — > (a), B — > (/3), 
then + B — > (a) + (/3) and — > (a) (13). Hence we have 
the important 

Theorem 2. Let SR be an n-dimensional vector space with the 
basis (eiy e 2 > • • *, e n ) over A. If A is a linear transformation in 
SR, we associate with A its matrix ( a ) relative to the basis (e u e 2 , • • •, 
e n ). Then A — > (a) is an isomorphism of the ring 8 of linear 
transformations in SR onto the matrix ring A n . 

EXERCISE 

1. Prove associativity and distributivity of matrix multiplication by using the 
corresponding properties of multiplication of linear transformations. 

5. Change of basis. Equivalence and similarity of matrices. 

Let (a) be the matrix of A e 8(SRi, SR 2 ) relative to the bases (e iy 
e 2 , • • •, e ni ), (/x,/ 2 , • • - We now change the bases in 9?i 
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and and we shall calculate the matrix of A relative to the new 
bases. Thus let (u u u 2y • • •, u ni ) be a second basis in 9ti where 
Ui = and let (vi y v 2y • • • > v n2 ) be a second basis in 9t 2 where 

v p = 'Zvpqfq. The matrices (m) and (p) are non-singular, and we 
write their inverses as (m) -1 = (mzj*)> (*')~ 1 = (^g*)- Now we 
have 

u%A (^jfXijCj) A Xidij(ejA) ^i\XijOijpfp 



where 



2 l^ijCX-jpVpq Vq jZjQiiqVq 

&iq = ^ > V'ij&jpVpq* • 
j,P 



Hence the new matrix of A is 



(15) (a) = (fx)(a)(p) 1 

where (m) gives the change of basis in 9?i and where (v) gives the 
change of basis in $ft 2 . 

Now we shall call two n x X n 2 matrices (a) and (d) equivalent 
{or associates) if there exist matrices {\x) and (v) in L{ A y n x ) and 
L{ A, w 2 ) respectively such that 

(16) (a) = (m)(«)W- 

Thus we see that any two matrices of the linear transformation 
A relative to different bases in the two spaces are equivalent. 
The converse is also clear. For let (a) and (d) be related as in 
(16) and let A be the linear transformation whose matrix is (a) 
relative to (e ly e 2y • • •, e ni ) and (/i,/ 2 , • • - ,/ n2 ). Then this linear 
transformation has the matrix (d) relative to (ui y u 2y • • *, « ni ), 
(w u w 2y • • •, w n2 ) where 

Ui = Wp = '2tVpq*f qy {v) = {v p q*)» 

Assume next that 9?i = 9? 2 = 5R and that {e iy e 2y • • •, e n ) is a 
basis. Let (a) be the matrix of A relative to {e ly e 2y • • •, e n ). 
Then eiA = and our computation shows that the matrix 

of A relative to {u\ y u 2y • • •, u n ) y U{ = 'Znijej is 

(d) = (m)W(m)- 1 . 



(17) 
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Two matrices in A n that are related in this way are said to be 
similar. As in the case of equivalence it is clear that two ma- 
trices in A n are similar if and only if both are matrices of the same 
linear transformation relative to (single) bases for 9? over A. 

As we shall see in the next section, it is easy to give necessary 
and sufficient conditions for equivalence of matrices. On the 
other hand, the problem of similarity requires a fairly elaborate 
analysis, which we shall undertake in the next chapter. At this 
point we illustrate the method that we shall use to handle this 
problem. 

Example. We wish to prove that the matrices 





r° 


1 


0 


... 0 




ft 


O' 




0 


0 


1 


... 0 




ft 




(a) = 


0 


0 


0 


... 1 


- 03 ) = 








A 


0 


0 


... 0- 




-0 


fn- 



where the ft are the n distinct nth. roots of unity, are similar in C n , C the field of 
complex numbers. We use (a) to determine a linear transformation A in an n 
dimensional vector space 9? over C. This is done by choosing a basis 
(eiy <? 2 , • • <?n) in 9? and defining eiA — ^ a^ej — for i < n and = e\ for 

i = n. Then in order to prove our assertion, we must find a basis («i, # 2 , • • • , u n ) 
such that UiA = # t -ft for the A that we have just defined. Without giving the 
details as to how one goes about finding such u ;, we shall show that the following 
Ui satisfy the requirements. 

Ui — e ± + ft -1 * 2 + ft~ 2 ^3 + * * * + ft -(n_1 V n . 

Thus 

UiA = e% + 3 + • • • + ft _(n— l) <?i = ft«i, 

and the U{ form a basis since the matrix of (« 1 , u 2 , • • • , u n ) relative to 
(*i, e 2 y • • •, e n ) is the Vandermonde matrix 





rl 




fl~ 2 ' 


•• fi-'—”! 


0*) = 


1 




f2- 2 • 


•• f 2 - (n - 1) 




1 


fn- 1 


fn~ 2 • 





in which the ft 1 are distinct (cf. Exercise 1, p. 24). This proves the similarity 
and shows in fact that (/3) = (/z)(a)(jLt) _1 where ( fi ) is the above matrix. 
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EXERCISES 

1. Prove that the relations of equivalence and of similarity are reflexive, sym- 
metric and transitive. 

2. Prove that, if A has characteristic 0, then the following two matrices are 
similar in A n 





rl 


1 • 


• V 




n 


0 • 


• O' 


(a) = 


1 


1 • 


• 1 


- 08) = 


0 


0 • 


• 0 




.1 


1 • 


>• 1 - 




.0 


0 • 


•• 0. 



6. Rank space and null space of a linear transformation. If 

A is a linear transformation of 9?i into 9? 2 , and © x is a subspace of 
3?!, then the image © x A consisting of all vectors of the form x x A y 
Xi in ©i, is a subspace of 9? 2 . If y i 8 ©i> then Xi + jy x e © x , 
and so Xi A + y x A = (# x + yi)A e © x yf. Also, if x x e © x , axi e 
© x ; hence a{x x A) = ( ax x )A e © x yf. If the vectors / x ,/ 2 , • • *,/ w 
are generators for © x , any Xi e © x has the form Hence any 

X\A = 2£i(fiA). Thus the image vectors f x A y f 2 A y • • • y f m A are 
generators for © x yf. If the /»• form a basis for © x , m = dim © x . 
The images fiA need not form a basis for © x ^, but since they are, 
in any case, generators, their number m > dim © x yf. Hence we 
see that the dimensionality of the image space never exceeds 
that of the original space. 

We shall call the subspace 9i x ^ of dt 2 the rank space of A\ its 
dimensionality the rank of A . If (e u e 2y • • e ni ) is a basis for 
3f x , dhA = [eiA y e 2 A , • • •, e ni A]> the space spanned by the vec- 
tors eiA . Hence the rank of A is the rank of the set {e\A y e 2 A , 

• • •, e ni A). If (/i,/ 2 , • • •,/„,) is a basis for and e { A = £ a.v/j> 

3 

i = 1, 2, • • •, w x , then the rank of the set ( e x A y e 2 A , • • •, e ni A) is 
the same as the row rank of the matrix (a) of A determined by 
the bases (e u e 2y • • •, e ni ), (/ x ,/ 2 , • * - ,/ W2 )- This proves the fol- 
lowing 

Theorem 3. The rank of a linear transformation A of a vector 
space equals the row rank of any matrix of A . 

We consider next the totality 31 of vectors z in $R X such that 
zA = 0. It is readily verified that 3t is a subspace of 3f x . We 
call it the null space of A and its dimensionality the nullity of A . 
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We prove now the main theorem relating the rank and the nul- 
lity of A. 

Theorem 4. Rank of A + nullity of A = n u the dimensionality 
of the space $Ri. 

Proof. Let (z ly z 2 , • • • , z v ) be a basis for 9L We can supple- 
ment this basis by n x — v vectors Xi to obtain the basis (x ly • • *, 
x ni - v ; z u z 2y • • •, z v ) for 9ti. The vectors 

X\Ay X 2 Ay *, X ni — V A y Z\Ay *, ZyA 

are generators for the rank space A C 9? 2 . Since the ZiA = 0, 
the vectors x x A y • • •, x ni _ v A are also generators for A. But 
these vectors are linearly independent. Thus if 

Pi(xi A) + /3 2 (x 2 A) + • • • + Pni-p( x m-p^) — 0, 

then (E(3iXi)A = 0 and e = [z ly z 2y •••, z v ]. Since the 

set (xiy • • •, x ni _ vy z u • • •, Z v ) is an independent set, this implies 
that the are all 0. Hence if we set yi = X{A y (y ly y 2y • • •, 
y ni -„) is a basis for dt x A. Thus dim $tiA = n x — v = n x — nul- 
lity of A and this proves the theorem. 

We now supplement the basis (y u y 2y • • *, y ni _„) of dt x A to a 
basis (y u y 2y • • •, y ni _„ w u w 2j •• •, w n2 _ ni+ „) for 9? 2 . Then we 
have the relations 

XiA = y h i = 1, 2, • • •, p = n x — v 
ZjA 0 , j \y • • • y V, 

These show that the matrix of A relative to the bases {x u • • • , 
Xpy Zu ' •> z v)> ( yu '--lypyW U * * *, ^n 2 - P ) ^ 

(18) diagjl, . P 7l>0, •••,()}.* 

If the matrices of these bases relative to the original bases (e X y 
e 2 , •••, e ni ) and (/i, f 2y •••,/„,) are respectively (/x) and (v)> 
then, as we have seen in the preceding section, (n)(a)(v)~~ 1 is the 
matrix (18). The number p is the row rank of the matrix (a). 
This proves the following 

* We use this notation, introduced in (14), for a matrix whose non-zero entries occur 
only in the (1,1), (2,2), etc., positions. 
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Theorem 5. If (a) is an n x X n 2 matrix with elements in a 
division ring A and if (a) has row rank p, then (a) is equivalent to 
the matrix given in equation (18). 

If (a) and (a) are equivalent matrices, then we know that these 
can be taken to be matrices of the same linear transformation A . 
The row ranks of (a) and of (a) coincide with the rank of A. 
Hence equivalent matrices have the same row rank. Conversely 
if (a) and (a) have the same row rank p, then both of these ma- 
trices are equivalent to the same matrix (18). Hence they are 
equivalent. 

Theorem 6. Two n x X n 2 matrices with elements in a division 
ring A are equivalent if and only if they have the same row rank. 

We consider now the conditions that a linear transformation A 
of 9?i into 9i 2 be an equivalence. Since A is a homomorphism, 
A is 1-1 if and only if its kernel 9J = 0. Clearly 9J is the null- 
space of A. Hence A is a 1-1 transformation of 9?i onto 9J 2 if 
and only if: (1) 9^ = 0; (2) dt x A = 9i 2 . In the special case in 
which 9ii = 9J 2 = 9? either of these conditions is sufficient; for if 
91 = 0, rank A = dim 9t = n. Hence dtA = 9f. On the other 
hand, if 9M = 9?, rank A = n and nullity A = 0. Hence 9f = 0. 

It should be noted here, too, that, if A is an equivalence, then 
its inverse A~ x is also an equivalence. The verification is left 
to the reader. The equivalences of a vector space onto itself 
constitute a group relative to the resultant operation. If A is a 
linear transformation in 9? and ( a ) is its matrix relative to the 
basis (e u e 2 , • • *, e n ), then A is an equivalence in 9? if and only 
if (a) is a unit. Hence we see that the group of equivalences in 
9? is isomorphic to the group L{ A, n) of non-singular matrices in 
A n . The former group is called the full linear group in the vector 
space 9?. 



EXERCISES 

1. Prove that, i { A z S(9ii, 9?2) and ©i and Ui are subspaces of 9fi, then 
(@i + UiM = ©i A + Ui A and (®i (1 Vh)J C f] UiA. 

2. Prove that, if (a) and (/3) are m X n matrices with elements in A, then (row) 
rank [(a) + (/?)] < rank (a) + rank (jS). 

3. Prove that, if (a) is an m X n matrix and (0) is an n X p matrix with ele- 
ments in A, then rank (a)(/3) < min (rank (a), rank (/3)). 
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4. If 



(a) 



■1 -2 

.2 0 lJ 



find units (jjl) and (y) such that (jx)(a)(v ) has the form (18). 

5. (Fitting). If is a linear transformation in 9?, let 3 he the set of vectors z 
such that zA m = 0 for some m and let © be the intersection of all the rank spaces 
9?^*, k = 1, 2, • • •. Show that 3 and © are subspaces and prove that 9J = 

30©- 

6 . Show that A maps the two spaces 3 and © of Ex. 5 into themselves; that 
vf is nilpotent in 3 and an equivalence in ©. Use this result to prove that any 
matrix in A n is similar to a matrix of the form 

m <n 

l 0 ( 7 )J 

where (J3) is nilpotent and where ( 7 ) is non-singular. 



7. Systems of linear equations. We consider the left-handed 
system of linear equations 



(19) 



£ i<211 + $2«21 + • * • + inOi n 1 = 61 
£ l <*12 + ^ 2«22 + • ' * + £ n <* n 2 = ^2 



£l^lm “f" ^2 a 2m * * * "i" %n&nm — S m , 

Here the a# and Si are given elements of the division ring A, and 
we seek solutions & = Pi in A. If & = Pi satisfies these equa- 
tions, then we say that the w-tuple (fi ly p 2y • • • , P n ) is a solution. 
If (Pu Pzy • • •, Pn) and (Piy p 2 f y • • •, Pn) are two solutions of 
(19), then ( 71 , 72 , • • •, y n ) where 7 i = P / — pi is a solution of 
the system of homogeneous equations obtained by taking the 
Si = 0 in (19). Conversely if (P u jS 2 , • • •, p n ) is a solution of (19) 
and ( 71 , 72 , • • *, y n ) is a solution of the homogeneous system, 
then (Piy p 2y • • *, /3 n ') where + 7 ,- is another solution of 

(19). This shows that in order to obtain the solutions, if any, of 
(19) we have to find a particular solution of this system and all 
solutions of the corresponding homogeneous system. We will 
then obtain all solutions of (19) by adding to the particular solu- 
tion all solutions of the homogeneous system. 

We therefore consider first the question of the existence of 
solutions for (19). We introduce the vectors Ui = i = 1, 
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2, •••) n 9 v = 25 j/j where (/i, / 2 , • • -,/ m ) is a basis for the m 
dimensional space ©. Then it is immediate that (/Si, p 2y • • •, /3 n ) 
is a solution of (19) if and only if the Pi satisfy 

( 20 ) V = P\U\ + P 2 U 2 + * * * + PnU n - 

Hence (20) is solvable if and only if v is linearly dependent on 
Ui , u 2y • • *, u n . On the other hand, v is linearly dependent on 
the Ui if and only if 

rank (« x , u 2y • • *, « n ) = rank (u u u 2y • • *, u n , 0 ), 

and this in turn holds if and only if the row rank of the matrix 
(a) is the same as that of the augmented matrix 

'an a 12 

a 2l OL 2 2 

( 21 ) 

Ol-nl Ot n 2 

-5i 5 2 

In particular, if A = <£ is commutative, then we have the fol- 
lowing 

Theorem 7. A system of linear equations (19) with aij and 5 y 
in a field <I> has a solution £*• = Pi in $ if and only if the matrix (a) 
of the coefficients and the augmented matrix (21) have the same de- 
terminantal ranks . 

We consider next the homogeneous system obtained by setting 
5 j = 0 . To study this system we introduce also an n dimensional 
vector space 9? with the basis (*i, e 2y • • •, e n ), and we let A be 
the linear transformation whose matrix is (a) relative to the bases 
( eiy e 2y - y e n )y (/i,/ 2 , * * *>/m)* Then in the above notation the 
vectors Ui = dA and (Pi y p 2y • • P n ) constitute a solution of the 
homogeneous system if and only if 2 PiUi = 0 . Since Ui = eiA , 
this amounts to the condition (2 Piel)A = 0. Thus (P 1 , p 2y • • 
/3 n ) is a solution if and only if 2/3^ is in the null space 91 of A. 
If is the nullity of A y we have a basis (z x , z 2 , • • •, z v ) for 91 and, 
if z k = 2 Pi (k) e iy k = 1 , 2 , • • *, Vy then 

05 1 ( - 1) , /3 2 (1) , • • •, /3n (1) ), • • •> 08i w , ^2 W , • • •> /3» w ) 
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is a set of (left) linearly independent solutions of the homogeneous 
system. Moreover, any solution (/Si, fi 2y • • •, fin) is a linear com- 
bination of these solutions. Because of the relation between 
rank and nullity, we know that v = n — p where p is the row 
rank of the matrix (a). We therefore have the following result: 

n 

Theorem 8. Let = 0, j = 1,2, • • - y m y be a left-handed 

i= 1 

system of homogeneous equations and let the row rank of {a) be p. 
Then there exist n — p linearly independent solutions (fif k \ fi 2 ( ‘ k \ 
• • *, fif k) ), k = 1, 2, • • *, w — p, such that any solution of the sys- 
tem is a left linear combination of these solutions . 

An immediate consequence of this theorem is the result, noted 
previously in an exercise (p. 13), that a system of m homo- 
geneous equations in more than m unknowns has a non-trivial 
solution. We remark also that in the commutative case we can 
drop the modifier “left” in the above statement and replace rank 
by determinantal rank. 



EXERCISE 

Find all the solutions of the following system 

2£i — £2 + £3 ~ 3£ 4 = 0 
£1 + £2 — £3 + 2£ 4 = 0 
4£i + £2 — £3 + £ 4 = 0, 

A, the field of rational numbers. 

8. Linear transformations in right vector spaces. If 9?/ and 

9t 2 ' are right vector spaces, a linear transformation of dtf into 
dt 2 ' is defined to be a mapping of 9V into 9J 2 ' such that 

(22) (*' + y')A = x’A + y'A y (x'a)A = (x’A)a 

for all x' y y' in 9?i' and all a in A. The discussion for left vector 
spaces can be carried over to the present situation with one or 
two notational changes. If (ef, e 2 ' y • • •, e n f) and (fi^ff, • • 
f n f) are bases in 9?/ and 9J 2 '> respectively, we write 

n 2 

el A = 23 //«>*> 

i=i 



(23) 



i 1, 2, • "j 81 
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and we call 
(24) 



"a li 


Oi 12 


«21 


a 2 2 


'^nil 


&ni2 



^lW2 

£*2712 

^niW2 



the matrix of the linear transformation A relative to the given bases . 
It should be noted that (24) is the transposed of the matrix of 
the coefficients on the right-hand side of (23). As before, to the 
sum of linear transformations corresponds the sum of matrices. 
The situation for the product is, however, different from that of left 
vector spaces. Suppose that (gf> g 2y • • g nz ') is a basis for $R 3 ' 
and let B be a linear transformation of $R 2 / into 9J 3 '. Let (0) be 
the matrix of B relative to the bases (f\ y ff y * * ',fn 2 ')j (gi> Z* > 
• • • » gn%)- Then 



Hence 

d’AB 



f/B = ZgkPkj, j = 1 , 2 , • • •, » 2 . 

: (Hfj'aji)B = 22 ( f/B)aji = 22 gkPkjaji 

\ j / j j,k 

: ^gk T ki 



where y ki = ^ Pkj&ji- Thus the matrix of C = AB is the prod- 

j 

uct (0)(a) and not (a)(0) as before. 

If S' = S'(3?', $R') denotes the ring of linear transformations of 
$R' into itself, then the correspondence A — >• (a) between the 
linear transformations and their matrices relative to a definite 
basis (ef, e 2 r y • • *, e n r ) is now an anti-automorphism. Thus it is 
1-1, and to the sum of linear transformations corresponds the 
sum of the matrices, and to the product of linear transformations 
corresponds the product of the corresponding matrices taken in 
reverse order. The ring S' is anti-isomorphic to the matrix ring 
A n and hence also to the ring S of linear transformations in an 
^-dimensional left vector space over A. 

A change of basis from (ef y e 2y • • •, e n f) to (uf, uf , • • *, # ni ') 
where = 'Ze/ixji and a change of basis from (f\ y ffi • • • ,fn 2 ) 
to (vf, v 2 , • • •, v n2 ') where vf = ^ffv qp is now reflected in a 
change of the matrix of A from (a) to 

(a) = M-VXm). 
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If a single basis is used in SR/ = SR 2 / = 3/> the matrix (a) of A 
is replaced on changing the basis by the similar one (ju) 1 («)(/*) 
where (ju) gives the change of basis. 

Our discussion of rank and nullity may be carried over with- 
out change. The rank of A now turns out to be the column rank 
of its matrices. This implies that equivalent matrices have the 
same column rank as well as the same row rank. Now we have 
seen that any matrix (a) is equivalent to one in the “normal” 
form (18). Also it is immediate from the definition that a matrix 
in normal form has the same row rank and column rank. It fol- 
lows that the arbitrary matrix (a) has the same row rank and 
column rank. We state this result as 

Theorem 9. The row rank and the column rank of any matrix 
are the same . 

A second, and somewhat more geometric, proof of this result 
will be given in § 12. We note finally that the theory of right- 
handed systems of linear equations can be developed in a man- 
ner completely analogous to that of left-handed systems con- 
sidered above. We have only to replace left vector spaces by 
right vector spaces. 

EXERCISE 

1. State and prove the analogue of Theorem 7 for right-handed systems. 

9. Linear functions. We have defined a linear function on a 
vector space 9? to be a mapping x — > f{x) of into A such that 

f(x +y) =f(x) +f(j)> f(ax) = af(x). 

If we define the sum of two such mappings in the usual way by 

(/ + £)(*) =/(*) +g(*) 

and the product fix for /x in A by 

(//*)(*) = /(*)/*, 

then we obtain a right vector space over A. This can be 
verified directly. However, it may be more illuminating to in- 
tegrate this result into the general theory of linear transformations 
developed in § 2. 
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For this purpose we recall that a linear function is just a linear 
transformation of dt into the one-dimensional vector space A. 
Now the right multiplication Mr* £ — > £m is a linear transforma- 
tion in the vector space A, since 

(£ + = (£ + rj)n = £ M + W = &r + Wr 

(o£)Mr = (<*£)/* = <*(£m) = a(fMr). 

Also we know that the set of linear transformations of 9i into A 
is a commutative group under addition and that the resultant of 
a linear transformation of 9J into A and a linear transformation 
of A into itself is a linear transformation of 9 1 into A. Hence if 
/ and g e S(9i, A), then f + g and f\x r e 2(9f, A). Since the defini- 
tion of /m given above is simply the resultant j\x ry 

(/ + S )M = (/ + g)Vr = fVr + gVr = /m + gV 

/<A+ *0 =/(M + *0r =/(Mr + J'r) = //*r + >r = /m +> 

/(M^) =fWr =A^rVr) = = (/m> 

/I =/U=/. 

This establishes the assertion made above that the set 9J* of 
linear functions is a right vector space over A. We call this 
space the conjugate space of the vector space 9?. 

Now suppose 9i is finite dimensional with the basis (e iy e 2y 
• • •, e n ). A linear function / is completely determined by the 
values f(ej) = a;. Moreover, for arbitrary oli e A there exists a 
linear function such that f(ej) = If x = then f(x) = 

We can also state these simple results in terms of matrices 
as follows. If/ is a linear function and f(ej) = a iy then we can 
use the basis 1 in A and write f(ej) = Then we see that the 
matrix of / relative to (e iy e 2 , • • e n ) y (1) is 

«2 






( 25 ) 
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We obtain in this way a 1-1 correspondence between 9?* and the 
set of n X 1 matrices with elements in A. Sums correspond 
under this correspondence. Also it is immediate that, if / has 
the matrix (25), then fix has the matrix 

'oti ix 
a 2 ix 

- OLnlX . 

From this it is clear that 9t* is equivalent to the right vector 
space of ^-tuples (now written as columns). Evidently this im- 
plies that 9i* is n dimensional over A. 

The last result can be made somewhat more transparent in 
the following way. If (e u e 2y • • *, e n ) is a basis for 9? over A, we 
define a set (<?i*, e 2 * y • • *, e n *) of linear functions by the equations 

(26) €% (€j) Sijy iyj 1^ 2, * y My 

where 5 t *y = 0 if i ^ j and = 1 if / = j. Then the value of the 
linear function 'hef'a.i for ej is 

^ ^ ojy(^y) 'y ^ &i (^y)o^y J 1, 2, * , n 

i i 

and this implies that the e* form a basis for 9?* over A. Thus if 
/ is any linear function and f{e 3 ) = ay, then / and 2^*a* have the 
same values at the ej. Hence / = 2^*a*. Also the d* are lin- 
early independent; for if 2 e*a.i = 0, then the displayed equa- 
tions show that every ay = 0. We shall call the basis (*i*, e 2 *, 

• • *, e n *) the complementary basis to (e u e 2y • ••, e n ). Such bases 
will play an important role in the sequel. 

EXERCISE 

1. Prove that the right multiplications are the only linear transformations in 
the vector space A. 

10. Duality between a finite dimensional space and its con- 
jugate space. Let * be a fixed vector in the space 9? and let / 
range over the conjugate space 9f* of 9f. Then the mapping 
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/ /(#) appears to be a mapping of 9i* into A. To emphasize 

that we are now dealing with a function of / e 9t* we write x(f) 
for /(#)• We assert that x(f) is linear; for 

*(/ + .?) = (/ + g)(x) = Ax) + g(x) = x(J) + x(g) 

*(/«) = (/<*)(*) = /(*)« = *(/)«. 

Thus we see that every vector x e 9{ determines an element x(f) 
of the (left) vector space 9?** of linear functions defined on 9J*. 

We consider next the properties of the mapping x —> x(f) of 
9 1 into 9i**. We note first that this mapping is a linear transfor- 
mation of 9 1 into 9£**; for 

(x + y)(f) =f(x +y) = f(x) + f(y) = x(f) + y(f) 

and this means that the function associated with x + y is the 
sum of the function x(f) andjy(/). Also 

(ax)(f) = /{ax) = af{x) = ax(f), 

so that the linear function associated with ax is a times the linear 
function associated with x. We prove next that x — > x(f) is 
1-1. For this it suffices to show that, if x(f) = 0 for all f y then 
x = 0. Now this is clear; for if x ^ 0, we can take x as the first 
vector ei of a basis (e u e 2y • • *, e n ). We have seen that there 
exists a linear function / such that /(ef) = ai for any given 
In particular, we can find an / such that f(x) = f(e i) 0. 

We have now established that the mapping x — » x(f) is a 1-1 
linear transformation of 9? into 9?**. Hence if © denotes the 
image space, dim © = dim 9 1 = n. On the other hand, we know 
that dim 9t = dim 9t* = dim 9?**. Hence dim © = dim 9t**. 
This, of course, implies that © = 9?**. We are now led to the 
striking conclusion that every linear function on 9?* can be ob- 
tained as an x(f) for some vector x e 9J. This is the important 
principle of duality that 9i can be identified with the conjugate 
space of (the space of linear functions on) 9 1*. As a first applica- 
tion of this result we prove the following 

Theorem 10. If (^i*, e 2 *> • • •> e n *) is a basis for 9?*, then there 
exists a basis {e u • • •, e n ) for 9t such that ef{e 3 ) = 5^. 
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Proof. We know that we can find in 9 ?** a basis Oi**, e 2 **, 
•••, e n **) that is complementary to the given basis Oi*, e 2 * y 

• • •, e n *) y that is, £;**(/,•*) = But there exist vectors e iy e 2y 

• • • , e n in 9J such that *y*(*<) = *,•**(*,•*), /, j = 1, 2, •••,». 
These vectors satisfy the conditions of the theorem. 

We shall establish next a reciprocal relationship between the 
subspaces of 9J and the subspaces of its conjugate space 9 t*. If 
© is a subspace of 9 ?, we let^X©) be the totality of vectors £ 89 ?* 
such that£(jy) = 0 for ally e ©. The set ./’(©) is clearly a subspace 
of 9t*, and we shall call it the subspace of 9i* incident with ©. 
Similarly if ©* is a subspace of 9 t*, we obtain the subspace ^(©*) 
incident with ©*. This consists of the vectors y such that g(y) 
= 0 for all g 8 @*. Evidently if ©1 3 © 2 , then y(@i) £ ^(©2), 
that is, the correspondence © — > 7*(©) is an order reversing cor- 
respondence of the lattice L(fR) of subspaces of 9 ? into the lattice 
L(9t*) of subspaces of 9 ?*. It is clear also that ./OX©)) 3 ©. 
We prove next the following 

Lemma. For any © c 9 t, y(©) = w — /or any 

©* c 91 *, dim j {<&>*) = n — ©*. 

Proof. Because of the duality between 9 t and SR*, it suffices to 
prove one of these statements. We choose the first. Let © be a 
subspace of 9? and let (u 1 , u 2y • • *, u n ) be a basis for 9 1 such that 
(uiy u 2y • ••, « r ) is a basis for ©. Let (#1*, # 2 *, * * ««*) be the 

complementary basis for 9t*. Suppose now that£ = e y(©). 

Then (2« f -*j8 t -)(«y) = 0 for 7 = 1 , 2, • • •, r. But (S« t -*j8»)(«y) = ( 3 j . 

n n 

Hence £ = 23 Conversely, any linear form y = 23 

r+1 r + 1 

satisfies = 0 for j = 1, 2, • • •, r. Hence also ^(jy) = 0 for 
every y e ©. Thus yX©) = [# r+ i*, & r +2*> •••, «»*]• Hence 
dim © = r while dimy(©) = n — r. 

An immediate consequence of this lemma is that ./OX©)) = © 
and j OX©*)) = ©* for any © and ©*. Thus we have seen that 
i 0 (©)) 3 ©. Moreover, dim ./OX©)) = n — (n — r) if dim © = 
r. Hence dim j OX®)) = dim ©, and this implies that © = 

We now see that the mapping @ — » _/(©) is a 1-1 mapping of 
L(9t ) onto £(«*). If Mi) = M*)> then ©i = KMi)) = 
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i0*(©2)) = © 2 - Moreover, if ©* is any element of L(9?*), then 
©* = /(©) where © = 7(©*). 



EXERCISES 

1. Prove that, if © — ■» © is a 1-1 order reversing correspondence of one lattice 
onto a second one, then ©i + ©2 = @1 H ©2 and ©1 fl ©2 = ©1 + © 2 . 

2. Let / be a linear function on 9? and let / denote its contraction on the sub- 
space © of 9?, that is, / is the mapping y — > f(j) for y in ©. Show that the 
mapping / — > / is a linear transformation of 9t* into the space ©* of linear func- 
tions on ©. Show that .;(©) is the null-space of this transformation. Prove 
that / — > / is a mapping of 9?* onto ©*. 

3. Prove that, if ei*, e<i *, • • •, e r * are linearly independent linear functions 
then there exist vectors <?i, ^ 2 , • • *, e r such that efiej) = 8a, i,j = 1, 2, • • •, r. 

11. Transpose of a linear transformation. Let A be a linear 
transformation of 9?i into 9?2 and let / be a linear function de- 
fined on 9?2* Then the mapping x x — > f(xiA) is a linear function 
defined on 9ti; for this mapping is simply the resultant Af of 
A and/, that is, 

(27) (Af)(x 1) = f(x 1 A). 

We now let / vary over the conjugate space 9?2* of 9?2- Then we 
obtain a mapping/ — » Aj of 9t2* into 9h*. We assert that it is 
linear. By the distributive law 

J(f + g) = ^f+Ag 

and by the associative law 

A(M = A(fn r ) = (Af)n r = (47)m 

and these are the defining conditions. We shall call the linear 
transformation / — > Af of 9J 2 * into 9?i* transpose of A and we 
denote it as A*. Hence fA* = Af where on the right Af is the 
resultant of A and /. 

We consider next the properties of the mapping A — > A*. 
First, if 5 is a second member of 8(9ii, 9?2)> then A + B e 8(9Ji, 9J 2 ), 

f(A + B)* = (A + B)f =Af+ Bf=fA* +fB* = f(A* + 5*). 

Hence 

(28) (A + B)* = A* + B*. 
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Now let 9t 3 be a third vector space and let C be a linear transfor- 
mation of into 9i 3 . Then AC e?(91i, 91 3 ). Hence if h e 91 3 *, 
h{AC)* = (AC)he9t 1 *. Moreover, 

h(AC)* = (AC)h = A(Ch) = A(hC*) = (hC*)A* = h(C*A*) 

and this proves the rule 

(29) (AC)* = C*A*. 

Our results also apply, of course, to right vector spaces. In 
particular, if A* is a linear transformation of the right vector 
space of linear functions 9 1 2 * into the right vector space 9?i*, 
then we can associate with it a transpose transformation of 9fi** 
into 9 ? 2 **j where 9?,** is the left vector space of linear functions 
on 9i t *. On the other hand, we have the natural equivalences of 
9ti** onto 91,- and these enable us to define a linear transformation 
of 9? 1 into 9?2 corresponding to the transpose of A*. We shall call 
this transformation the transpose A** of A* in 9?i to 9?2- We 
now determine the transformation A** for the given linear trans- 
formation A* of 91 2 * into 91i*. Let / vary over 91 2 * and let Xi 
be a definite vector in 9ii. Then the mapping 

f^ Xl (/A*)^(fA*)( Xl )e A 

is a linear function on 9i 2 *. Hence there is a uniquely determined 
vector #2 e 9i2 such that 

(30) (M*)( Xl ) = *i (fA*) = x 2 (J) =/Cv 2 ) 

holds for all / e 9?2*- We now have a mapping A** sending Xl 
into # 2 . The argument given above shows that this mapping is 
linear. This can also be verified directly. 

We shall now show that the two correspondences A — >- A* 
and A* — » A** are inverses of each other, that is, if A* is the 
transpose of A e ?(9ii, 9t 2 ), then the transpose of A* in 9?i to 9t 2 is 
A and, if A** is the transpose in 91i to 9?2 of A* e 8(9?2*> 3fi*) } then 
the transpose of A** is A*. To see the first of these statements 
we note that (j A*)( Xl ) = f( Xl A**) by the definition of A**. 
Hence j(x\A) = f( Xx A**) and j(x\A — Xl A**) = 0 holds for all 
/. As we have seen (p. 54) this implies that Xi A = Xx A** for 




58 



LINEAR TRANSFORMATIONS 



all Xi so that A = A**. The second statement is proved by 
using the relation 

(fA*)( Xl ) = /( Xl A**) = {fA***){ Xl ). 

Since this holds for all X \ ,fA* = fA*** and A* = A***. It is 
now immediate that the mapping A — » A* is 1-1 of 8(91i, 3^2) 
onto 8(812*, 9ti*). 

In the particular case of 8 = 8(91, 91) and 8* = 8(91*, 91*) the 
mapping A — » A* is an anti-isomorphism between these two 
rings. This is clear from the 1-1-ness and equations (28) and 
(29) - 

12. Matrices of the transpose. Again let A be a linear transfor- 
mation of 91 1 into 91 2 and let (<?j, e 2 , • • •, e ni ), (/i,/ 2 , • • - ,/„,) be 
bases in these spaces. We choose complementary bases (ei*, e 2 *, 
•••,<?„,*), (/i*,/2*, • • - ,/n*) in the spaces 91 1*, 91 2 *. Hence 

e*(e } ) = Sij, f P *(f q ) = 5 pq . 

Suppose now that (a) and (/3), respectively, are the matrices of 
A and of its transpose relative to these bases. Then 

eui = X (Xipfp> fg*A* = 2 ej*Pj g . 

P 

Sine e/ q *(eiA) = f q *A*{e % ), 

f Q* (^2 a ipfp \ = X (fi~) ■ 

\ P / j 

Hence a iq = f3 iq and (a) = (j8). In other words* if complementary 
bases are used in 9t** and in $R*** i = 1* 2* then the matrices of A 
and of its transpose are equal . 

This result is perhaps a little unexpected. Its explanation lies 
in our definitions of matrices of linear transformations in left 
vector spaces and in right vector spaces. In the special case in 
which A = <£ is a field the result usually appears in a somewhat 
different form. Here it is customary to consider all spaces as 
left vector spaces* as can be done by writing ax for xa. Then 
the relations f q *A* = 22 that we have just derived may 
be written as 3 

f q *A* = 22 e j* a jq = 22 a jq e j* = 22 a qj e o* 
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where a q / = aj q . Thus the matrix of A* considering the 0?»* as 
left vector spaces is the transposed matrix (a)' of the matrix (a) 
of A. If 0? x = 0? 2 = 0? we have the isomorphisms A — > ( a ) and 
A* — > (a)' of the rings of linear transformations 8 = 8(0?, 0?) and 
8(0?*, 0?*) respectively onto the matrix ring 3> n . Since yf —» A* 
is an anti-isomorphism we see that the mapping (a) — > (a)' is an 
anti-isomorphism in <£ n , that is, (a) — » (a)' is 1-1 and 

[w + os)]' = w' + os)', [(«)(«]' = m«y. 

This, of course, can also be verified directly. 

We return now to the general case of an arbitrary division ring, 
and we shall establish the following relation between rank spaces 
and null spaces of a linear transformation and its transpose. 

Theorem 11. The null space of A is the subspace incident with 
the rank space of the transpose A * of A. The rank space of A is the 
subspace incident with the null space of A *. 

Proof. It is sufficient to prove the first of these statements. 
Hence let z be a vector such that zA = 0. Then fizA) = 0 and 
(fA*)(z) = 0 for all /. Hence z e j(0? 2 *^*). The converse fol- 
lows by retracing the steps. 

We can now prove 

Theorem 12. Rank A = Rank A *. 

Proof. Rank A * = dim (0? 2 *^*) = dim j(yi) where 0? is the null 
space of A. Also dimy(9?) = n\ — dim 9? where ni = dim 0?i = 
dim 0?i*. On the other hand, rank A = n\ — dim 9?. Hence 
rank A = rank A *. 

If ( a ) is any matrix, (a) can serve as the matrix of a linear 
transformation A and also of the transpose A* of A . Then, as we 
have seen, rank A is the row rank of ( a ) while rank A * is the 
column rank of (a). Theorem 12 therefore gives a geometric 
proof of the theorem that the two ranks of any matrix are equal. 

13. Projections. We conclude this chapter by considering a 
type of linear transformation of a vector space that is intimately 
connected with direct decompositions of the space into subspaces. 
Suppose that 0? = 0?i ® 0? 2 © * • • © 0?r> that is, 0? is a direct sum 
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of the subspaces 9 in the sense of § 11, Chapter 1. Then if x is 
any vector in 9?, x may be written as 



( 31 ) x = Xi T - %2 T - * * * T" %r 

where Xi s dti. We know also that for a given / the component Xi 
of x in is unique. Hence the mapping Ei that sends x into Xi 
is single-valued. We wish to investigate the properties of these 
mappings. Let y = yi + jy 2 H h y ri y% e then 

x + y = (xi + y i) + (*2 + J 2 ) + * * • + (x r + y r ) 

where + jy* e 9f*. Thus the component in 9t» of x + y is + 
Hence we have the following condition on Ei\ 

(x + y)Ei = xEi + yEi. 

Also by (31) 

OLX = OiX 1 T" OLX 2 T" * * * T" OiX r , 



Hence the component in 9t» of ax is aX{. In other words, we have 
the relation 

(ax)Ei = a(xEi). 

We therefore see that Ei is a linear transformation in 91 over A. 

We consider next the relations that hold among the E *. We 
note first that (31) may be rewritten as 

(32) x = xE\ T“ XE 2 ~b * * * 4” xE r . 

Hence 

(33) Ei + E 2 + • • • + E r = 1. 

If Xi e dliy the representation (31) of x = Xi reads Xi = x^ Thus 
XiEi = x iy XiEj = 0 i f /. Now for any x , xEi = Xi e 9?*- 
Hence we see that xEi 2 — XiEi = Xi = xE{ and xE{Ej = XiEj = 
0. We therefore have the following equations. 

(34) = E iy EiEj = 0 if i 7 ^ j. 

Now, in general, we shall call a linear transformation E that 
is idempotent in the sense that it is equal to its square ( E 2 = E) 
a projection. A set of projections will be called orthogonal if the 
product of any two distinct ones in the set is 0. Finally we shall 
say that the set of orthogonal projections E ly E 2y • • •, E r is sup- 
plementary if their sum is the identity mapping. Then the re- 
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suits that we have just obtained concerning the mappings Ei de- 
termined by the decomposition 9J = 9?i ® © • • • © 9? r are that 

these mappings form a supplementary set of orthogonal projec- 
tions. We shall now show that every finite supplementary set of 
orthogonal projections is obtained in this way. Thus let E u E 2 , 

• • • > E r be such a set. Let 9L = 8i Ei, the rank space of £»•. 
Then if x is any vector in 9f 

x = xl = x(2Ei) = xE x + xE 2 H + xE r 

e -f- ?HE 2 * * * H - 9?-E r * 

Hence 9t = 3?i + 9t 2 + • • • + $t r - Next let 

(35) Zi = Zi + • • • + Z{_ i + Z{+i + • • • + £ r 

where Zj e 9?y. Since Zi has the form yiE iy ZiE{ = yiE? — y { Ei = 
Zi. Similarly if j ^ /, ZjEi = y^EjEi = 0. Hence if we operate 
on the left-hand side of (35) with E iy we obtain z { and, if we oper- 
ate on the right-hand side, we obtain 0. Hence Zi = 0. This 

proves that 9 D (9t x H h 9^*_i + dt i+x H h 9J r ) =0 for 

i = 1, 2, •••,/-. Thus 9 1 = 9?i ® 9? 2 © • • •© 9 ? r - Since # = XxEi 
and xEi = Xi e 9 ?i, the projections determined by this decomposi- 
tion are the mappings x —> xE iy that is, the given linear transfor- 
mations E{. We have therefore established a 1-1 correspondence 
between direct decomposition of the vector space and finite sets 
of supplementary orthogonal projections. 

We obtain next canonical bases for the set of supplementary 
orthogonal projections E u E 2y • • •, E r . Let 9t = 9fi ® • • • ® 9i r 
as above. Then we can obtain a basis (/i,/ 2 , • • -,/ n ) for 9 1 such 
that (/i, • • - ,/ pl ) is a basis for (/ P1 +i, * * *,/ P i+ P2 ) is a basis for 
9?2, etc. Thus pi = rank E Since EjEi = 0 for j ^ /, the linear 
transformation Ei annihilates all the /’s outside the i- th subset 
• • -j/pi+.-.+pi). Since Ei 2 = E iy any vector in 9 U 
= 9 lEi is sent into itself by E *•. It follows that the matrix of 
Ei relative to the basis C/ 1 ,/ 2 , * * -,/n) is the diagonal matrix 

- — Pi . 

(36) (5,) = diag {0, •■•,<>, 1, - 1,0, *-.,0} 

in which all elements are 0 except those in p x + • • • + pi_i + 1st 
to pi + • • • + pith rows. 
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We now drop the assumption that = 1. Thus we suppose 
that E\ y E 2y • • *, E r is any finite set of orthogonal projections. 
Set E = and E r+ 1 = 1 — E. Since EiEj = 0 for i ^ j, <r , 
E 2 = XE 2 = XE{ = E. Hence also E r+ i 2 = (1 — E) 2 = 1 — 
2E + E 2 = \ — 2E E = l — E = E r+ i. Moreover, if j < r, 
EjE = Ej'ZEi = E 2 = Ej and similarly EEj = This implies 
EjE r+1 = Ej{ 1 - E) = Ej - EjE = - Ej = 0 and 

= 0. We have therefore shown that the set E iy E 2y • • £ r +i is 
a supplementary orthogonal set of projections. Our discussion 
shows that there exists a basis (fufzy * * *,/n) of 9J such that the 
matrix of Ei relative to this basis has the form (36), i = 1, 2, 

■*> r - 

These remarks apply in particular to a single idempotent lin- 
ear transformation. If E x is of this type, then E x and E 2 = 1 
— Ei are orthogonal and supplementary. Relative to a suitable 
basis the matrix of Ei has the form diag {1, 1, • • •, 1, 0, • • •, 0}. 
The number of Ts is the rank of E\. 

EXERCISES 

1. Prove the following theorem on matrices: If (ei), (€ 2 ), • • *, (e r ) is a set of 
matrices such that 

(37) fc) 2 = fe), (e;)(e/) = 0 if i 7 * j> 

then there exists a matrix (jit) in £( A, n) such that has the form (36). 

2. Prove that two projections £ and F have the same rank spaces if and only 
if EF = E and FE = F. Prove that they have the same null spaces if and only 
if EF = £, FE = E. 

3. Show that, if Ei, £ 2 , • • •, Ek are projections with the same rank spaces and 
a 1 , « 2 , • • *, oik are elements of A such that 2a; = 1, then 2 a;£; is a projection 
with the same rank space as the Ei. 

4. Assume that the characteristic of A is not 2. (This means that, if a 5 ^ 0, 
then 2a = a + a 9 ^ 0 in A.) Prove that, if £1 and £2 are projections whose sum 
is a projection, then E\ and E 2 are orthogonal. 

5. Show that, if E\ and £2 are commutative projections, then £i £2 and 
£1 + £2 — E\E 2 are projections. 
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This chapter is devoted to the study of a single linear transfor- 
mation in a vector space over a field. We shall obtain a decompo- 
sition of the vector spaces into so-called cyclic subspaces relative 
to a given linear transformation. By choosing appropriate bases 
in these spaces we obtain certain canonical matrices for the trans- 
formation. These results yield necessary and sufficient condi- 
tions for similarity of matrices. Following Krull we shall derive 
the fundamental decomposition theorems by making use of the 
theory of finitely generated o-modules, o a principal ideal domain. 
We shall also prove in this chapter the Hamilton-Cayley-Fro- 
benius theorems on the characteristic and minimum polynomials 
of a matrix. Finally we study the algebra of linear transforma- 
tions that commute with a given transformation. 

1. The minimum polynomial of a linear transformation. 
Throughout this chapter we assume that the underlying division 
ring is a field. We use the notation <£ for this field. Let SR be a 
vector space over <£ and let 8 = 8 (SR, 9i) be the set of linear trans- 
formations of 9? into itself. We recall that 8 can be regarded as 
an algebra over <£: that is, 1) 8 is a ring; 2) 8 is a vector space 
over <£ with addition the same as the ring addition; and 3) 

y{AB) = (y A)B = A{yB) 

holds for A, B in 8 and y in $. The product y A is by definition 
the resultant 7 iA = Aji. We recall also that 8 is w 2 dimensional 
over <£. 
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Now let A be an arbitrary linear transformation in 9? and let 
[1, A , A 2 , • • •] be the subspace of 2 spanned by the powers of A. 
If A is not a multiple of 1, the pair (1, A) is a linearly independent 
set. Again if A 2 is not linearly dependent on (1, A ), then, since 
(1, A) is an independent set, so also is (1, A , A 2 ). Continuing 
in this way we obtain the linearly independent sets (1), (1, A) y 
(l y A y A 2 ) y (l y A y A 2 y A*) y .... 

Since 2 is finite dimensional, this process must break off after 
a finite number of steps. This, of course, occurs when we reach 
the first power A m of A which is linearly dependent on preceding 
powers. Then (1, A y A 2 y •••, A m ~ l ) is linearly independent, 
but A m e [1, A y A 2 y * • •, A m ~ 1 ]. Hence we have a relation of the 
form 

(1) A m = /x 0 l + Mi A + • • • + fx m _iA m ~ 1 

where the Mi e$. Multiplication of (1) by A yields, in virtue of 
the distributive law and the algebra condition 3) above, the 
equation 

A m + 1 = ixqA + mi A 2 + * * * + n m —\A m . 

Since A m e [1, A y A 2 y • • •, A m ~ x ] y this shows that A m+1 e [1, A y 
A 2 y * • *, A m ~ x ]. Similarly we can prove that A mJr2 y A mJrZ y • • • 
are in [1, A y A 2 y • • •, A m ~ x \. Hence 

[l y A y A 2 y ...] = [1 >A y A 2 y ••• y A'*~ 1 ], 

and since the displayed set of A k is a linearly independent one, 
we see that [1, A y A 2 y * * •] has the basis (1, A y A 2 y • • •, A m '~ v ). 

The relation (1) has an important interpretation which we shall 
now examine. We note first that the mapping 

( 2 ) a — » al = ai 

is an isomorphism of $ into 2. Thus 

(a + 0)1 = al + 01 

( 3 ) 

(a/3)l = M)(l 2 ) = a(Kl 2 )) = «(1(181)) = (al)(j81) 

and al ^ 0 if a 0 so that the mapping is 1-1. It follows that 
the image set 3>1 is a subfield of 2 isomorphic to <£. 
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Now let <i>[\] denote the integral domain of polynomials in the 
transcendental element (indeterminate) X with coefficients in <£. 
The linear transformation A commutes with every element 
a\ (= ai). It follows from this that the mapping 

(4) /(X) = a 0 + aiX + a 2 X 2 + • • • — > /(A) 

= aol 4“ oc\A + a 2 A 2 

is a homomorphism of $[X] into 2. This homomorphism is an ex- 
tension of the isomorphism (2), and it is characterized by the 
property that it maps X into A. Let $ be the kernel of our homo- 
morphism, that is, $ is the ideal of polynomials v(\) such that 
v(A) = 0. Now it is known that$[X] is a principal ideal domain.* 
By this we mean that every ideal in <£[X] consists of the multiples 
of a particular polynomial. In particular $ = (/x(X)) is the set 
of multiples of /z(X). The generator ju(X) is uniquely determined 
if we specify that its leading coefficient is 1. By the relation (1) 

A m - Hm-iA™- 1 - Mol = 0. 

Hence we see that the polynomial 

(5) X- - Mm— iX w— 1 - Mm — 2 X m 2 Mo 

which is 7^ 0 is in $. The result that we established before there- 
fore shows that $ 9 ^ 0. We now note that the polynomial (5) 
as determined by (1) is the generator n (X) of for otherwise 
there exists a polynomial of degree lower than m such that 
A r 4- v r -.\A r ~ l 4 b ^o = 0, and this contradicts the linear in- 

dependence of 1, A y • • •, A r . 

We shall call the polynomial /z(X) the minimum polynomial of 
the linear transformation A . It is characterized by the following 
properties: 1) leading coefficient = 1; 2) y, (A) = 0; 3) if p(X) is 
any polynomial such that v(A) = 0, then v(K) is a multiple of 
m(X). 

The isomorphism between 2 and the matrix ring <I> n enables 
us to carry over to matrices the results that we have just de- 
rived. Let (a) be the matrix of A relative to e 2 , • • •, e n )- 
Then we know that the matrix of jS 0 l 4- $\A 4- • • • 4- PkA k rela- 



See, for instance, Volume I, p. 100, of these Lectures. 
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tive to this basis is 0 O 1 + j8i(a) + • • • + It follows that, 

if the polynomial ju(X) given by (5) is the minimum polynomial 
of A y then 

/*((«)) = ( a ) m — Mm-iW" 1 " 1 Mol =0. 

Conversely, a relation in (a) implies a corresponding relation for 
A . This implies that ju(X) is the polynomial of least degree that 
has (a) as a root. For this reason we shall also call ]u(X) the mini- 
mum polynomial of the matrix (a). 

EXERCISES 

1. Prove the rule (aP)(xy) = (pix)(0y) for any algebra. 

2. What is the minimum polynomial of an idempotent linear transformation 

5 * 0 ? 

3. Let 9? be the vector space of polynomials of degree < n — 1 with real co- 
efficients. Find the minimum polynomials of the differentiation operator D and 
of the linear transformation A such that JQl)A = /(X + 1). 

2. Cyclic subspaces. Of special interest in the study of a 
linear transformation A is the discovery of subspaces © that are 
invariant relative to A in the sense that they are mapped into 
themselves by A . If © is such a subspace, then A induces a 
mapping in © and it is obvious that this mapping is a linear trans- 
formation. 

The simplest type of invariant subspace is an invariant sub- 
space that is generated by a single vector. Let u be a particular 
vector in 9J and let © denote the smallest invariant subspace 
containing u. Since we©, so is uA\ hence also uA 2 = ( uA)A y 
uA z = ( uA 2 )A , • • • . Thus 

[Uy UAy UA 2 y * * *] C (g. 

On the other hand, if v is any vector in the space [w, uA y • • •], 
then 

v = a 0 u + cl\uA + • • • + ol t uA t 

= uf{A) 

where /(X) = a 0 + ot x X H — • + a r \ r . Hence vA = uf{A)A e [w, 
uA y • • •]. Thus this space is invariant, and [w, uA y uA 2 y • • •] is 
therefore the smallest invariant subspace of 9? containing the vec- 
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tor u. We now denote [u y uA y • • •] by {#}, and we call this space 
the cyclic space (relative to A ) generated by the vector u. 

Suppose now that u 9 ^ 0, and let uA r y r > 1, be the first vec- 
tor in the sequence u y uA y uA 2 y • • • that is linearly dependent on 
the preceding. Then, as in the above discussion of the powers of 
A y we can conclude that 

(6) {u} = [u y uAy • • •, uA r ~ 1 ] 

and that (u y uA y • • *, uA r ~~ 1 ) is a basis for this space. Also we 
have a relation of the form 

uA r = v 0 u + vi uA + • • • + iv_i uA r ~ l 

or, what is the same thing, u/i u (A) = 0 where 

(7) /x„(X) = X r — j' r _iX r_1 v 0 . 

From the definition of r it is clear that /x tt (X) is a polynomial ^ 0 
of least degree having the property ujjl u (A) = 0. We shall call 
Mu(X) the order of the vector u. 

Consider now the totality of polynomials v(\) such that 
uv(A) = 0. It is readily seen that 3 u is an ideal in $>[X]. The 
polynomial of positive degree is in $ u and is a non-zero 

polynomial of least degree in this ideal. It follows that 3u = 
(Mu(X)). We observe finally that Mu(X) is the minimum polyno- 
mial /x(X) of the linear transformation induced by A in { u }. For 
if v is any vector in {&}, v = uf(A ) so that 

vn u (A) = u/(A)fi u (A) = un u (A)f(A) = Of (A) = 0. 

Hence ix u (A) = 0 in {u} and therefore /l(X) | Mu(X). On the other 
hand, uji(A) = 0. Hence /Z(X) e 3 W and so Mu(X) | m(X). These 
two relations imply that /Z(X) = m u (X). 

If u — 0, {u} = 0. In this case $ u = (1)> and we shall say 
that the order is 1. 

If j u(X) is the minimum polynomial of A y then fi(A) = 0 and 
Ufx(A) = 0. It follows that the minimum polynomial of A is a 
multiple of every order /z M (X). 

3. Existence of a vector whose order is the minimum poly- 
nomial. Let eiy e 2y • • •, e r be a set of vectors that generate 9t 
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relative to A in the sense that every vector x can be written in 

r 

the form x = for suitable <£;(X) in <$[X]. Such finite 

1 

sets exist; for any ordinary basis (e u e 2 , * • *, e n ) has this property. 
Let m(X) be the least common multiple Lu ei (X), Me 2 (X), • • *, /z*(X)] 
of the orders /^(X). Since each n ei (\) is a factor of the minimum 
polynomial /z(X) of A, /Z(X) | /x(X). On the other hand, if x = 
Xei(t>i(A) y then 

xfi(A) = 'Le i <j>i(A)pL(A) = 'Le i p.(A)<l>i(A) = 0. 

Hence /2(yf) = 0 and this implies that /z(X) | /z(X). Hence /x(X) = 
m(X). Thus the minimum polynomial of yf is the least common 
multiple of the orders of any set of generators. This generalizes 
the result established above in the cyclic case. 

We now write the jz e .(X) in terms of the same primes, say 

Me,V) = 7T 1 (X) il V 2 (X) i2 *' • • • 7T S (X)* S *' 

where the As are distinct and kji > 0. We may also suppose that 
the leading coefficients of the As are all 1. Then if kj — max (kj U 
kj 2 y • • •, kjr), the polynomial 

7 ri(X)Sr 2 (X)* 2 • • • tt s (X)*' = /Z(X) = M (X). 

Now, in general, if the order /z x (X) = mi(X)m 2 (X), then the order 
of y = xijli(A) is fx 2 (X). The proof is immediate. Applying this 
fact we see that, if ki = k li0 then the order of 

h = • • • t r s (A) k ^ 

is 7ri(X) il . Similarly, we can find a vector f J} j = 2, • • •, s, of 
order Tj(\) k \ We shall now show that the vector 

/ =/i +/2 h/ s 

has order /x/(X) = /x(X). This will follow from the following more 
general result: 

Lemma. If the orders m»(X) of f^i = 1,2, • • •, j, relatively 
prime in pairs , of f — fi + /2 + * * • + f s is the prod- 

uct p(X) = II m;(X). 
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Proof. Since fipfA) = 0 for each /, fv(A) = 'Efiv(A) — 0. 
Hence m/(X) | KX). We note next that if pi(X) = m/(X)/x 2 (X)/x 3 (X) 

* * * Ms(X), then fpi(A) = 0 and fjpfA) = 0 for j = 2, 3, • • •, j. 
Hence, also f x pi(A) = 0. This implies that mi(X) | pi(X). Since 
Mi(X) is prime to M 2 OO 3 * * *, m«(X), it follows that Mi(X) | p/(X). 
Similarly p 2 (X) | m/(X), p 3 (X) | m/(X), • •. Hence v(\) = IIm;(X) is 
a factor of m/(X). Thus m/(X) = v(\). This completes the proof. 

The above discussion now gives the following 

Theorem 1 . There exists a vector in whose order is the mini- 
mum polynomial of A . 

We know that the degree of the order of any vector u is the 
dimensionality of the space { u }. Hence deg p w (X) < n . Thus 
we have the 

Corollary. The degree of the minimum polynomial of A is < n. 

EXERCISE 

1. Use the method of this section to prove that, if m is the maximum order 
of the elements of a finite commutative group G, then x m = 1 for all x in G. 

4. Cyclic linear transformations. The space is called cyclic 
(relative to A) and A is called cyclic (sometimes non-derogatory ) 
if there exists a single vector e that generates % that is, = {e}. 
We know that in this case p(X) = p e (X), and this shows that the 
orders of any two generators of a cyclic space are the same. It 
shows also that the minimum polynomial of a cyclic linear trans- 
formation has degree n; for if 9t = { e }, deg p e (X) = dim 9 1 = n. 
Hence deg p(X) = n. 

Conversely, suppose A is any linear transformation whose min- 
imum polynomial has degree n. Then by Theorem 1 there exists 
a vector e such that /z e (X) = p(X). Then degp e (X) = n and so 
dim {e} = n. This, of course, means that {e} = 9 t. We there- 
fore have the following criterion. 

Theorem 2. A linear transformation is cyclic if and only if its 
minimum polynomial has degree n . 

Suppose now that 9 1 = {e} and let 

M(X) = Me(X) = X" - p n — lX n 1 MO. 
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We know that the vectors eA y • • eA 7l_1 ) constitute a basis. 

We wish to determine the matrix of A relative to this basis. 
Since 

eA = eA 
(eA)A = eA 2 
(eA n ~ 2 )A = eA 71 - 1 

(eA n ~ 1 )A = fi 0 e + vi(eA) + • • • + Mn-i (eA 71-1 ), 
the matrix we seek is 



0 


1 


0 • 


• • 0 - 


0 


0 


1 0 


• • 0 


0 






• 0 1 


-Mo 


Ml 


M2 • 


• • Mn — 1- 



If /z(X) = X n — jjL n iX n 1 — • • • — ijl 0 is any polynomial, then the 

n X n matrix given in (8) is called the companion matrix of ju(X). 
If, as in the present case, A is cyclic with minimum polynomial 
jit(X), then (8) is called the Jordan canonical matrix of A. Clearly 
the minimum polynomial may be read off from this matrix. 

We proceed now to derive a canonical matrix that displays the 
prime powers of /j. (X) . We prove first 

Theorem 3. Let 9i = \e\ and /x(X) = jui(X) • • • m«(X) where the 
M ,-(X) are relatively prime in pairs. Then 9? = {ei} ® {e 2 } © • • • 
® {e„\ where jx ex (\) = m(\). 

Let Vi(k) = m(X)(m»(X)) -1 and let e { = ev t {A). Then /^(X) = 
Mi(X). We know also that the order of e' = <? x + e 2 +•••+<?« is 
m(X). Hence {e'} = 9i. On the other hand, \e'} c {<? x } + { e 2 } 

+ • • • + {e s } so that 9? = {^ x } + {e 2 } H h {?»}. Since dim 9? 

= degju(X) = 2 deg Mi(X) = 2 dim {et}, the subspaces { 1 ?,} are 
independent. Hence 9i = {ei} © {e 2 ) © • • • © 

Now let jx{\) = 7r x (X) 4 V 2 (X)** • • • ir s (X)^ where the ir s - are distinct 
primes. Then by Theorem 3, 9i = {^ x } © {^ 2 } © ••• © {tf«} 
where M ei (X) = tt^X)**'. 
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For the sake of simplicity of notation we assume for the mo- 
ment that s = 1. Then p(X) = 7r(X)* where 7 r(X) is prime, and if 

tt(X) = X 9 - iX 9-1 p 0 , 

w = deg p(X) = kq. We introduce the vectors 

/1 = h = /, = ew(A) k ~ 1 A q ~ 1 ; 

f q+1 = eir(;A) k -\ f q+2 = ei r(A) k ~ 2 A y • • •, f 2q = e* (A) k ~ 2 A q -' ; 



/(*-!)«+! ^3 f(k— l)a+2 ‘3 /*A;g — £^ 9 1 . 

Thus each / is of the form e 4 >(A) where deg <£(X) < kq. Moreover, 
the degree of the </>(X)’s associated with distinct /’s are different. 
It follows that the /’s are linearly independent, since a non-trivial 
relation of the form = 0 entails the existence of a polyno- 

mial v(\) 7 * 0 of degree < kq such that ev(A) = 0. This contra- 
dicts the fact that deg p(X) = kq and establishes the linear inde- 
pendence of the /’s. Since there are altogether kqf s, C/ 1 ,/ 2 , • * *, 
/fcg) is a basis for 9 X over <3>. 

What is the matrix of A relative to C/ 1 ,/ 2 , • • - yfkq) ? We have 
the following relations: 



AA=f 2 



f%A=f z 



f q — \A — f q 

f q A = eir{A) k - l A q = eir{A) k ~ l [A q - 7 v{A)\ 

= 6Tr{A) k 1 [p 0 l + PiA d b p q -iA q ~ l ] 

= Po/l + Plf 2 H h Pq-l/q 

f q +lA =/ fl+ 2 

fq+2 A = /g+3 
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f2q-\A — ftq 

f 2q A = e*{A) k - 2 A* = eit{A) k ~ 2 [A « - 
+ er {A^- 1 



= P 0 / 4 +I + Pl/q+2 + • • • + Pq-lf2q + /l 



— \)q-\-\A f {k — 1)4 +2 



J \k—l)q+q—\A fkq 



fkqA — Po/(*_ 1) 9 + 1 + Pl/(fc-l)g+2 + ’ ’ • + Pq-lfkq 

l J '{k — 2)4 + 1* 

Hence the matrix of A relative to the basis (/ 1 , / 2 , • • • , fkq) has 
the form 



( 9 ) 



'P 

N P 



N Pi 



where P is the companion matrix of 7r(X) and N is the q X q 
matrix 



( 10 ) 



00 ••• 0 

0 0 ••• 0 



ll 0 



0J 



We return now to the general case in which p(X) = 7ri(X) t: 7r 2 (X) A:! 
• • • ir*(X) i# and 91 = {e^ © {e 2 \ © • • • © {e a }. We choose in each 
a basis of the type just indicated. Together these bases 
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give a basis for 9{. The matrix of A relative to this basis is 



(ID 



B 2 



BJ 



where each Bi is determined by tt i(X) kt in the same way that B is 
determined by ^(X)*. 

We shall call (11) the classical canonical matrix of the cyclic 
linear transformation A . The prime powers Ti(\) ki that occur in 
the factorization of the minimum polynomial will be called the 
elementary divisors of the cyclic linear transformation. To the 
elementary divisor 7 r^X)* 1 is associated the block Bi in the classi- 
cal canonical form. As an example, let $> be the field of rational 
numbers, and let ju(X) = (X — 1) 3 (X 2 — 2) 2 . The classical ca- 
nonical matrix of the cyclic linear transformation with minimum 
polynomial jii(X) is 



1 0 0 
1 1 0 
0 1 1 






0 1 


0 0 




2 0 


0 0 




0 0 


0 1 




1 0 


2 0 



EXERCISES 

1. Show that, if ju(X) is any polynomial with leading coefficient 1, then there 
exists a matrix (a) that has ju(X) as its minimum polynomial. 

2. Show that, if A is a linear transformation that has the diagonal matrix 

diag {«i, < 22 , • • • , a n | 

where the are distinct, then the minimum polynomial of A is II(X — oti). Hence 
prove A cyclic. 
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3. Use 2 to prove that, if C is the field of complex numbers and fi, fc, • • •, f n 
are the n distinct roots of unity, then 



Ti 






0 


1 


0 


... o' 






and 


0 


0 


1 


... o 








0 


0 


0 


• 0 1 








1 


0 


0 


... 0 . 



are similar in C n . 

4. Show that, if A is a linear transformation with matrix 



'0 


1 


«13 • 


• Ciln 


0 


0 


1 <*24 • 


• OL2n 


0 


0 


0 • • 


0 1 


0 


0 


0 • • 


0 0 



then A is cyclic. Use this to prove that the given matrix is similar to 



"0 


1 


0 




O' 


0 


0 


1 




0 


0 


0 


0 


• 0 


1 


.0 


0 


0 


• 0 


0 . 



5. What is the classical canonical matrix of the cyclic linear transformation 
whose minimum polynomial is (X — 1) 3 (X 2 + 1) 2 (X + l) 2 ? Assume to be the 
field of real numbers. 

6. Prove that, if 9i = {^ 1 } © { } © • • • © {e 8 } and the orders ^(X) are rela- 
tively prime in pairs, then is cyclic. 



5. The 3>[X]-module determined by a linear transformation. 

For the further study of a linear transformation we shall make 
use of another idea, namely, that the homomorphism between 
<£[X] and the ring 21 = [1, A, A : 2 , • • • ] can be used to turn the vector 
space 9? into a <£[X]-module. For this purpose we define the prod- 
uct </>(\)x y x in dt y to be the vector x<j>(A). Then 

0(X) 0 + y) = 0 +y)<t>(A) = x<j>(A) + y<t>(A) = <j>(X)x + </>(X)jy, 

and since the linear transformations corresponding to </>(X) + 
and </>(X)^(X) are respectively <i>(A) + \[/(A) and (t)(A)\f/(A ) = 
\l/(A)<t>(A), 
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[0(x) + iKx)]# = x[4 >(* 0 + 

= <t>(X)x -f- \f/(X)x 

fe(X)*(X)]* = x[f(A)4>(A)] = (xt(J)MJ) = 0(X)(*(X)*) 

1a: = a;1 = 

Hence the basic module conditions hold.* There are several ad- 
vantages to be gained in adopting the module point of view. In 
the first place we replace the ring 21 by the ring $[X], The for- 
mer may have quite a complicated algebraic structure; it may 
contain nilpotent elements and it may have zero divisors. On 
the other hand, we know that the polynomial ring <f>[A] is an in- 
tegral domain, and we have available the arithmetic theory of 
this ring. The basic arithmetic fact concerning $[A] is that every 
ideal in this ring is a principal ideal. 

A second advantage of the module point of view is that it 
leads us to the consideration of other <i>[\]-modules that are of a 
simpler structure than 9? and that can be used to study 9i. In 
fact we shall reduce our study of 9? to that of free modules and 
of submodules of free modules. This is accomplished in the fol- 
lowing way: Let (e ly e 2y • • •, e n ) be a basis for 9 t over <£. Then, 
of course, the ei form a set of generators for 9 1 relative to <£[X]. 
The <£[X]-module 9? is not a free module since for any e 8 9t there 
exists a non-zero polynomial /x e (A) such that Me(X)^ = e\x e {A) = 0. 

n 

Hence there exist relations of the form X) 7t(X)*» = 0 in which the 

i 

7i(X) are not all 0. In order to study these relations it is natural 
to introduce the free <L[X]-module g with basis (/i, t 2y • • •, t n ). 
If v = S0<(X)/»- is an element of g, then we associate with v the 
element vT = S0»(X)^ in 9L This correspondence is a 3>[A]- 
homomorphism of g on 9 l. Let 91 denote the kernel of this homo- 
morphism. It is immediate that 91 is a submodule of By defi- 
nition 27»(X)/f e 91 if and only if the relation 27 <(X)^< = 0 holds 

* It should be noted that the commutativity of multiplication in 21 was used in the 
verification of the third axiom. We can also regard 9i as a right <l>[X]-module. Here 
= x<f>(J). This is probably the more natural point of view. However, we have 
adopted the above method in order to be consistent with our previous emphasis on left 
modules. 
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among the generators Thus in a sense 91 corresponds to the 
set of relations that obtain among the 

We can easily determine the submodule 9?. Let eiA = 2a^y 
so that (a) is the matrix of A relative to the given basis. Then 
it is clear that the elements 



— (X — — a\2h a\ n t n 



( 12 ) 



v 2 — + (X — a 22 )t 2 



&2 An 



V n — —Oinlh ““ Oin2h — ‘ ‘ + (X — OL nn )t n 

are in 91. We shall now show that 9? is free and that the v’s form 
a basis for this module. The definition Vi = X/» — gives 

\ti = Vi + 2 oiijtj. These relations can be used to express any 
element v = 20;(A)/* in the form 2^(A)p i + 2 pf { where the 
Pi e <£. Clearly if v e% then 2 pfi e 9?. Hence 2p l -<? l - = (2p/,)T 
= 0. But the ei are ^-independent, and this implies that every 
Pi = 0. Hence v = 2i/^(A)^, which shows that the v { are genera- 
tors for 9Z. Suppose next that 2^(X)o* = 0. Then 

2 \pi(k)\ti = 'Zf\pi(X)aijtj 

and 

fc(x)x = Z 

J 

If any ^ ^ 0, let be one of maximum degree. Then clearly 
the relation ^ r (X)A = ^ ^Ay(X)ay r is impossible. This proves that 

3 

the A* form a basis for 9L 

6. Finitely generated o-modules, o, a principal ideal domain. 

There is still another advantage of the module theoretic method: 
It is easily generalized. In place of $[A] we can consider any 
principal ideal domain o and any finitely generated o-module $K.* 
With a slight increase in difficulty we shall obtain in this way 
other important applications. The most noteworthy of these is 
the theory of ordinary finitely generated commutative groups. 

* A good deal of what we shall do can also be done for 0-modules for which 0 is a non- 
commutative principal ideal domain. This theory can be applied to the theory of a single 
linear transformation in a vector space over a division ring. See the author’s Theory of 
Rings , New York, 1943, Chapter 3. 
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As we have noted in Chapter 1, this theory can be obtained by 
regarding commutative groups as o-modules for o, the ring of 
rational integers. 

Hence let 9? be any finitely generated o-module and let (ei> e 2y 
• • - y e n ) be a set of generators. Following the procedure that we 
used above we introduce the free module g with basis t 2 , • • *, 
t n ) and the mapping T: — » 2 <£^-. The mapping T is an 

o-homomorphism. Hence the kernel 91 is a submodule of g. We 
shall now show that any submodule of a free o-module, o princi- 
pal, is free. It is convenient to use the convention that the mod- 
ule 0 is a free module with a vacuous basis. 

Thus let 91 be any submodule of the free module g with the 
basis (/i, t 2y • • *, t n ) • We shall show by induction on n that 91 is 
free and has a basis of m < n elements. This is clear if n = 0. 
Hence we assume it true for modules g' with bases of n — 1 ele- 
ments. We consider now the totality 3 of elements /3 X of o such 
that Piti + £ 2^2 + • • * + PJn e 91. It is immediate that 3f is an 
ideal; hence 3 = (5i). If di = 0 every element of 91 is of the 
form P 2 t 2 + • • • + p n t ny and therefore it belongs to the free mod- 
ule g' with the basis (t 2y hy • • t n ). In this case the result is 

n 

clear. Now assume that Si ^ 0 and let Vi = 5i/i + ^ fi/tj be 

2 

an element of 91 that has the “leading coefficient” Si. Then if 
v is any element of 9t, v = 2/3*/* and Pi = pidi. Hence v' = 
v — fiiVi e g'. Of course this element also belongs to 91. Now the 
intersection 9K = 9t fl g' is a submodule of g', and so we can 
assume that it is a free module with the basis v 2y • • *, v my 

m 

m < n. Then v' = 23 and v = niVi + Hence ( Vi , 

k = 2 

m 

v 2y • • • , v m ) is a set of generators for 91. Suppose next that ^ P% v % 

1 

= 0. Then if we replace V\ by 5i/i + 2/3 //,• and the other v’s by 
their expressions in terms of / 2 , / 3 , • • • , t n we obtain 

P 1 S 1/1 + p 2 t 2 + • • • + Pm'tm = 0. 

n 

Since S x ^ 0, this implies that = 0. Hence we have ^ P^k 

2 

= 0 from which we conclude that all the p’s are 0 since the ele- 
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ments (v 2y v 3y • • •, v m ) form a basis for 9T. This concludes the 
proof of the following 

Theorem 4. If g is a free o - module with a basis of n elements 
and o is a principal ideal domain y then any submodule of g is free 
and has a basis of m < n elements. 

7. Normalization of the generators of g and of 31. We shall 
suppose now that the elements (v Xy v 2y • • v m ) are merely genera- 
tors for the submodule 31 of the free o-module g with basis (t ly 
t 2y • • •, /»). We write 



Vl — Vllh + <r I2h + • • * + <7 1 Jn 



(13) 



V 2 — (J 2 \t\ + O' 22 t 2 + • • • + o 2n t n 



V m = Gm\t\ + O m 2h + * * ‘ + GmJ n . 

The matrix (a) with elements Oij in o is uniquely determined by 
the ordered sets (v ly v 2y • • •, v m ) and (t ly t 2y • • •, t n ). 

Let (nij) be a unit in the matrix ring and let the inverse matrix 
be (vij*)- Define t/ = 2/z*/y. Then we assert that the t/ con- 
stitute another basis for g. In the first place we have the relations 

^rflki h = = ^&kjtj = tfc. 

id 

Hence any element 2<fo/ z - may be written in the form 2^// and 
the t/ are generators. Next suppose that Dyd/ = 0. Then 
2 7 Wifi = 0. Hence X) y my = 0. But this implies that 

id i 

yk = X^ y iflijV'jk* = 
id 

and this proves that the t/ form a basis. 

In a similar fashion we see that if (v pq ) is a unit in o my then the 
elements vf = 'Lv pq v q constitute a second set of generators of 91. 

Suppose now that the basis {t Xy t 2y • • - y t n ) is replaced by (//, 
t 2 y • • •, t n f ) where // = 2 (/z) a unit, and that the set of 
generators (v Xy v 2y • • •, v m ) of 91 is replaced by (vf y v 2 ' y • • - , vj) 
where v p f = '2v pq v q . What is the matrix of (vf y v 2 \ v m ') 
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relative to (tf, t 2 \ • • •, /»')? We have the relations 

Vp = yi VpqVq == ^ Vpq& qdi == ^ > VpqG qiHij^tj 

q q,i q,i,j 

— ^Tpjlj y 

where r pj = ^ v pq i T q nxij*. Thus the new matrix is related to 

q,i 

0) by 

(14) (r) = WOlKm) -1 . 

We are now led to the problem of making the “right’' choices for 
the units (?) and (/z ) -1 so that the matrix (r) has a simple “nor- 
mal” form. 

8. Equivalence of matrices with elements in a principal ideal 
domain. Two m X n matrices with elements in o are said to be 
equivalent if there exist a unit (/z) in o n and a unit (v) in o m such 
that 

M = WWW* 

This relation is an equivalence. The special case in which o is a 
field has been considered previously. Here we learned that a 
necessary and sufficient condition that (a) and (r) be equivalent 
is that they have the same determinantal rank. A consequence 
of this result for the general case of an arbitrary principal ideal 
domain is that a necessary condition for equivalence is equality 
of ranks. For let (r) = M(o-)(m), where (/z) and (v) are units. 
Then if P is the quotient field of o y then the given equation may 
be regarded as a relation between matrices with elements in P. 
Clearly (/z) eL(P, n) and (v) eI(P, m). Hence by the result in 
the field case (r) and (a) have the same determinantal ranks. 

We consider now the problem of selecting among the matrices 
equivalent to a given m X n matrix (<r) one that has a particularly 
simple form. The result that we wish to obtain is the following 

Theorem 5. If ( <r ) is an m X n matrix with elements in a prin- 
cipal ideal domain o, there exists a matrix equivalent to (a) which 
has the “diagonal” form ( cf . (18), p. 45) 

diag {Si, S 2 , • • •, S r , 0, • • • , 0} 

where the S ,• are 7 ^ 0 and 5,- 1 5 j (5» is a factor of Sj ) if i < j. 
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(a) We give first a constructive proof of this theorem which is 
valid for the case where o is Euclidean. We recall that a commu- 
tative integral domain is called Euclidean if there is defined a 
degree 8(a) for every a in the domain, such that 8(a) is a non- 
negative integer and such that 

1. 8(a) =0 if and only if a = 0. 

2. 8(ab) = 8(d) 8(b). 

3. If a is arbitrary and b ^ 0, then there exist elements q and 
r with S(r ) < 8(b) such that a = bq + r. 

We refer the reader to Volume I, p. 123, for the proof of the fact 
that any ideal in a Euclidean domain is principal. We proceed 
now with the proof of the theorem. 

We note first that the square matrices 



1 



J 



t^) = 



1 • • j8 • • • / 

1 



1 



1 



i 



Di(y) 




1J 




THE THEORY OF A SINGLE LINEAR TRANSFORMATION 81 



where / 5^ j and 7 is a unit in 0 and 



a 



t 



1 



Pij = 



0 . . . 1 . . . i 

• 1 



1 

1 • • • 0 

1 



j 



u 



are units; for TyOS) -1 = Tij(-fi), Z)i(/y) -1 = D*(/y -1 ) andP« -1 = 
Pij. It is easy to verify that 

I. Left multiplication of (a) by an m X m matrix Tij(( 3 ) gives 
a matrix whose /th row is obtained by multiplying the _/th row of 
(o-) by /3 and adding it to the /th row and whose other rows are 
the same as in (<r). 

Right multiplication of (<r) by an n X n matrix Ta(fi) gives a 
matrix whose ^th column is /3 times the zth column of (<r) plus the 
_/th column of (<r) and whose other columns are as in (<r). 

II. Left multiplication of (<r) by an m X m matrix Z),( 7) 
amounts to the operation of multiplying the /th row of {a) by 7 
and leaving the other rows unchanged. 

Right multiplication of ( a ) by an n X n matrix D t { 7) amounts 
to the operation of multiplying the /th column of (<r) by 7 and 
leaving the other columns unchanged. 

III. Left multiplication of (<r) by an m X m matrix P amounts 
to interchanging the /th and yth rows of (a) and leaving the re- 
maining rows unchanged. 

Multiplication of ( a ) on the right by an n X n matrix Pa 
amounts to the interchange of the /th and yth columns of (<r). 
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We refer to the matrices 7\y, D{> Pij as elementary matrices , 
and multiplication by these matrices as elementary transformations 
of the respective types I, II and III. Any matrix that is ob- 
tained from (cr) by a finite sequence of elementary operations is 
equivalent to (o'). We now proceed to the main part of the proof. 

If (o') = 0, we have nothing to prove. Otherwise let o^- be a 
non-zero element of least degree in (o'). Elementary transforma- 
tions of type III will move this element to the (1,1) position. If 
or k = criiPk + Pik where the degree 8(p lk ) < 5(ori), then we may 
use an elementary transformation of type I to replace 0*1* by p lk . 
If Pik 7 ^ 0, we have a new matrix equivalent to (<r) in which there 
are non-zero elements of degree less than the minimum non-zero 
degree in (a). We repeat the original process with this new ma- 
trix. Similarly if a kl = cr n /V + p k 1 where 8 (p kl ) < 5(ori), then 
either p k 1 = 0 or we obtain a matrix equivalent to (or) that has a 
non-zero element of degree less than the minimum 5(o’* ; ) 7^ 0. 
Since the minimum degree 7^ 0 is constantly decreasing, we even- 
tually reach a matrix in which (?i k = <rnj3& and <r k i = 0T1/V for 
all = 1, 2, Then elementary transformations of type I 

yield an equivalent matrix of the form 



(15) 



Pn 0 

0 P22 



0 



P2n 



P 11 = OTl* 



10 pm2 



Pmn J 



This process can now be applied to the submatrix (p*y) / = 2, 
• • • y m; j = 2, ••*,«. The necessary transformations do not 
affect the first row or column, and yield an equivalent matrix of 
the form 



>11 


0 


• 


... 0 - 


0 


T22 


0 


... 0 


0 


0 


^33 


’ ‘ * T3n 


.0 


0 


T m3 


T mn J 



Continuing in this way we obtain finally an equivalent matrix of 
diagonal form, say diag {ei, e 2 , • • *, e r , 0, • • •, 0} where the c» 7^ 0. 
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If ei = €ijS + pi with 8(pi ) < 6(ei) and p* 5 * 0, then we may re- 
place our diagonal matrix by the equivalent matrix 



€1 • • • ei 



ei 



*i + 1 



This is equivalent to a matrix in which the element in the (1, i) 
position is pi . We then repeat our earlier process to “diagonalize” 
this matrix. We obtain a new matrix diag {771, 772, ■ • • , 0 , • • • , 0} 

in which 6(771) < 5(ei). A number of repetitions of this process 
leads eventually to an equivalent diagonal matrix, diag {8 ly 5 2 , 
• • •, 5 r , 0, • • *, 0} in which the non-zero elements satisfy the 
required divisibility conditions. 

(b) The argument in the general case is similar to this. Here 
we use the length of an element, that is, the number r of primes 
7 Ti that occur on a factorization of the element as 7ri7r 2 • • • 7r r 
as a substitute for the degree used above. This number is an 
invariant because of the unique factorization theorem in 0 (cf. 
Volume I, p. 122). Suppose that an has smallest length for the 
non-zero elements and suppose that an /Y or*.* Write a = <rn, 
]8 = a i& and let 5 be a highest common factor of a and 0. Then 
there exist elements £, 77 such that a% + ^77 = 5. Set f = /3S -1 , 
d = — a5 -1 . Then we have the matrix relation 



1 




r 




1 0 - 






.77 6 . 




.0 1. 



Thus these matrices are units; consequently 

* a X b means “ a is not a factor of b .” 
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k 

£ o ••• o r o ••• 

0 1 



u = 



r i 0 



1 

0 Q • 

1 



k 



1J 



is a unit. The matrix ( a)U has as its first row the vector (5, a 12 , 
* 13 , • • •, *i(k-i ), 0, a Uk+i), • • •, o-in). since a/K/3, the length 
of 5 is less than that of a = <7n. Eventually this process and a 
similar one applied to the first column yield a matrix equivalent 
to (a) in which an is a factor of each a^ and of each o-&i. As 
before, this leads to a matrix of the form (15). The remainder 
of the argument is a repetition of the one given above. 

A matrix equivalent to ( a ) that has the form given in Theorem 5 
is called a normal form for the matrix (<r). The diagonal elements 
of a normal form are called invariant factors of (a). We shall 
show in § 10 that they are, in fact, invariants of the given matrix. 



EXERCISES 

1. Obtain a normal form for the matrix 

'6 2 3 Oi 

2 3-41 
-3 3 12’ 

.-1 2 -3 5. 

0 = 1 the ring of integers. 

2. Obtain a normal form for the matrix 

'X — 17 8 12 -14' 

, » -46 X + 22 35 -41 

(<7)= 2 -1 X-4 4 

. -4 2 2 X - 3- 
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0 = Z?o[X], Rq the field of rational numbers. Also find units Qz) and ( y ) such that 
WWW = normal form. 

3. Prove that a diagonal square matrix, diag {5i, £ 2 , • • •, 5 n ) is a unit if and 
only if every 5 t - is a unit. 

4. Use 3. to prove that, if 0 is Euclidean, then any unit in o n is a product of 
elementary matrices. 

5. Show that, if d is a greatest common divisor of the elements a\> a 2 , • • *, a n > 
then there exists a unit (j iz) in 0 n such that ( 01 , • • •, a n )(jj) = (^,0, • • •, 0). 

6. Show that, if the elements an , ^ 12 , • • • , i n are relatively prime (have 
greatest common divisor = 1), then there exist elements a a, i = 2, • • *, n,j = 
1, 2, •••,», such that (a) is a unit. 

9. Structure of finitely generated o-modules. In order to state 
the structure theorems it will be well to introduce some general 
concepts concerning o-modules. Some of these have already been 
encountered in the special case of modules determined by a linear 
transformation. 

Let x be any element of the o-module 9t. Then the set $ x of 
elements P e 0 such that (3x = 0 is an ideal. We call this ideal 
the order ideal of the element x. If 0 is a principal ideal domain, 
Sx = (mx)- Moreover, if 0 = <i>[X] and 9 ^ (0), then we may 
normalize /z x to have leading coefficient 1. The ix x thus obtained 
is the order that we have considered before. 

The totality {#} of multiples ax of the fixed element x in is 
a submodule. We call {#} the cyclic submodule generated by x 
and we call 9? cyclic if there exists an e such that 9t = {e}. If 
3x = (0), the submodule {#} is a free module. In the contrary 
case 3* 7^ (0) we say that x is of finite order . 

If y = ax and j8 e $ x , then Py = Pax = aPx = 0. Hence 
P e S y and so 3* C $ y . If y is a second generator of {#}, then also 
S y Q 3?*. Hence $ x = S y . Thus the order ideal depends only 
on the cyclic submodule and not on the particular generator of 
this submodule. 

A module 9 X is said to be a direct sum of the submodules 9fi, 

9 1 2 , ••■,81. if 

(16) 9i = 9Ji + 9?2 + * * * + 9ts 

in the sense that 9? is the smallest submodule containing all of 
the 9 hy and 

(17) St* n (9ti + • • • + 9?*_i + 3L+1 + • • • + dt s ) = 0 
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for / = 1, 2, • • •, j. If these conditions hold, we write 9? = 9? x 
® 0{ 2 © • • • © 0L- As in the case of a vector space 9J = 9^i ® 9? 2 
© • • • ® if and only if every x in 9? may be represented in one 
and only one way in the form x x + x 2 + • • • + where Xi e 0?;. 

We may now state the following fundamental structure theo- 
rem for o-modules, o a principal ideal domain. 

Theorem 6. If o is a principal ideal domain , any finitely gener- 
ated o - module is a direct sum of cyclic submodules . 

Proof. Let (Vi, e 2y • • *, e n ) be a set of generators for 0J. As 
above, we introduce the free module g with basis (t u t 2 > • • *, t n ) 
and the 0 -homomorphism T : 20*/* — > 20*** of g onto 3i. Let 01 
be the kernel. Then we have seen that 0J has a basis of m < n 
elements. For our purpose here we shall require only the weaker 
result that 01 is finitely generated. Let (sq, v 2y • • •, v m ) be a set 

n 

of generators and let = X i = 1 , 2, ■ ■ • , m, be the rela- 

i = l 

tions connecting the tfs and the t’s as before. We now replace 
the t’s by (//, / 2 ', • • •, / n 7 ) and the p’s by (fli', 0 2 ', * • *, vj) where 
// = 2/x*y//, p/ = 'ZvpqVq and (/x) and (?) are units. Then the 
matrix of the new set of generators (y/, v 2 \ • ••, v m ') of 0i relative 
to the new basis (//, t 2 \ • • •, t n f ) of g is (^) (cr) (/x) “ 1 . We choose 
the units (ju) and ( v ) so that (j')(<t)(m)~~ 1 has the normal form 
diag { 5 X , 8 2y • • •, 5 r , 0, 0, • • •, 0}. The relations connecting the 
new generators now read 

(18) V\ = 8\t\ y v 2 r = 5 2 / 2 ', * • *j 0 / = 5 r / r ', 

»r+i' = 0, • • •, = 0. 

Now define el = 2/z*y*y. Then since (m) is a unit these elements 
are also generators of 0J. We assert that 

(19) 9t = \e\ \ ® 2 } ffi * * * © { e n\ 

and that the order ideal of {*/} for / < r is (8»), while for / > r 
it is (0). Since the **' are generators, (16) holds. To prove (17) 

n 

we must show that, if ^ 0^/ = 0, then each /3 ,•<?/ = 0. Since 

i 

t/ = Z/7 7 = <?/. Hence 2/3^/ = 0 implies that 2 &// e ill. 
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Since the v/ are generators of this means that 52 £»// = 22 ypj- 

1 1 

Hence 

+ fish' + ' ' * + Pvfn = + * * * + 7r^/* 

Since the t/ form a basis for ft, this implies that 

Pi = y ibi for i < r and Pi = 0 for / > r. 

Thus certainly P^/ = 0 for / > r. Moreover, since v/ e 9t and 

v/ = hit/ for i < r, v/T = h { e/ = 0. Hence also P^/ = y = 

0 for i < r. This proves our first assertion and completes the 

proof of Theorem 6. 

We have also shown that Ptf/ = 0 for i > r implies that Pi = 0. 
Hence the order ideal 3 e/ = (0). Moreover, we saw that, if 

1 < r , then P#/ = 0 if and only if Pi is a multiple of hi. Hence 
for these i s, = {hi). This proves the second assertion made 
above. 

There is another interpretation of our results that we now give. 
We observe first that a direct sum of cyclic submodules such as 
{e r +i}, {e r + 2 / }, • • {e n '} in which the generators e/ have order 

ideal = (0) is a free module. For if p r +ie r +i + P r + 2 e T +2 + • • • 
+ i 8 n e n ' = 0, then each Pje/ = 0 and since 3v = (0), Pj = 0. 
We note next that the subset © of dt of elements of finite order 
is a submodule. For if yu J 2 8 © and P%y% = 0 for Pi ^ 0, then 
Pip 2 {yi — ^ 2 ) = 0 and P 1 P 2 7 ^ 0. Also if y e ©, then ay e © for 
any a. We shall now prove that the submodule © = {e/} © 
{ e 2 } ® • * * ® {e/} that consists of all linear combinations a^e/ 
+ cx 2 e 2 + • • • + oi r e T f is precisely the submodule © of elements of 
finite order in 9i. Since each e/ , i — 1, 2, • • *, r, is in ©, © C ©. 
On the other hand, let y = a x e/ + a 2 e 2 f + * • * + a n e n ' e ©. 
Then there exists a p 7 * 0 such that 

P&iei + Pa 2 e 2 + • • • + Poi n e n ' = 0 . 

We have seen that this implies that Paj = 0 for j > r. Hence 
aj = 0 for j > r and ye©. Thus © = ©. Now it is clear that 
91 = ©® {e r +i} © • • • ® {e n '} = ©©©where® = {e r+ i} © • * • 
© {e n '} is free. This proves the following 
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Theorem 7. If o is a principal ideal domain , any finitely gener- 
ated o- module is a direct sum of the submodule of elements of finite 
order and a free module . 

We note finally that in the decomposition 9? = © {ef} 

® * * * © {ef} we may drop the terms { e /} whose order ideals 
are (8j) = (1). If there are h such terms, we write en+f = f Jy 
j — 1 , 2, ••*,/, where / = n — h. We shall also denote the num- 
ber r — h of f’s of finite order by u and the number n — r of the 
remaining f’s by v. Finally we change the notation 8 h+J - to 8j. 
Then 

(20) 9? = {/i} © {7*2} © • * * ® {/*} 

where 

( 21 ) 3 *= («,■), j = 1 , 9 fh = ( 0 ), 

k = u + 1 , • • *, t and dj | 5j> if j <j'. 

For the case of an ordinary commutative group our results 
specialize to the following theorems. 

Theorem 8. Any finitely generated commutative group is a di- 
rect sum of cyclic groups . 

Theorem 9. Any finitely generated commutative group is a di- 
rect sum of a finite group and a free group . 

The finite group is © = {/ 1 } © {/ 2 } © • • • © {A}. Its order 
is 8i8 2 • • • 8 U if 8j is normalized to be positive. 

10. Invariance theorems. In this section we prove that the 
order ideals (5x), (5 2 ), • • (8 U ) and the number v = t — u of/’s 

in the free part [f u + 1 } © [fu+ 2 } ©•**©{/<} are invariants. 
We discuss first some more ideas concerning o-modules which 
will be needed for the proof. 

Let 9t be an o-module and let © be a submodule and 9?/© the 
factor group of cosets * + ©. If a e 0 and x + © e 9?/©, we de- 
fine a(x + ©) = ax + ©. Then since © is closed under scalar 
multiplication, the result ax + © given here does not depend on 
the choice of the representative x in the coset x + ©. It is 
readily seen that 9?/© becomes an o-module relative to this defi- 
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nition of scalar multiplication. The proof is the same as for vec- 
tor spaces (cf. § 9, Chapter I). We note also that, if 9i is finitely 
generated with generators e iy e 2y • • e ny then 91/© is finitely gen- 
erated with generators e x + ©, e 2 + • • *, e n + ©. 

We suppose next that 53 is an ideal in o having the property 
that fix = 0 for all /3 e 33 and all x e 8?. Let a = a + 33 be a 
coset in the difference ring o = o/33. Then we define ax = ax . 
If a = a l5 then a — a x = j8 e33. Hence a:# = («i + j3)tf = 

Thus o ' at is a single-valued function of a in o and x in 9L It is 
easy to verify that 9? is an o-module relative to this function. 

We give now the invariance proof which we shall break up into 
several stages. First let 9? = g be a free module with basis (t ly 
t 2y • • •, t n ). Let 91 be a submodule and suppose now that (tq, 
u 2y ••*, v m ) represents a basis for 91. Then the new elements 
t/ y Vp used in the proof of Theorem 6 form bases for g and 91 
respectively. We have the relations (18) connecting these ele- 
ments. Now since no element of a basis can be 0, clearly r = m 
in (18). Hence the number of t’s is not less than the number of 
y’s. Consequently m < n. If 91 = g, we may reverse the roles 
of the t r and the v' and prove that m = n. This proves that the 
number of elements in any basis of a free o-module, o a principal 
ideal domain is an invariant. As for vector spaces we call this 
number the dimensionality of the free module. 

Assume next that every element of 9t = © has finite order. 
Let 

©= {/l'}©{/ 2 '} ©•••©{//} 

be a second decomposition into cyclic submodules such that 
3// = ($/) ^ (0) and 5/ | dk for j < k. We wish to prove that 
t = t' and that (Sy) = (5/) for j = 1, 2, •••,/. We define the 
length of a decomposition © = {/i} © 1 / 2 } © * • • ® {ft) to be 
H,Si y where is the length of 5*. We suppose that the first de- 
composition has minimum length, and we shall prove the theo- 
rem by induction on this minimum length. 

Let 7r be a fixed prime and let ©' denote the subset of elements 
y in © such that iry = 0. ©' is a submodule of ©. Suppose that 
y — 'Ey if i e ©'. Then Eiry if i = 0. Hence irji is divisible by §;. 
This implies that either Si 1 y i or x | 5*. In the former case y if% 
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= 0, and in the latter is a multiple of Thus y 

t 

has the form Pj e jfh where h is the smallest integer such that 

h 

8h is divisible by 7r. This shows that 

© = { e hfh} © { Gh+lfh + 1 } © ' * * © 

Similarly, 

©' = {€*//*/}© •••©{ 6 ^/}. 

Since every element of ©' satisfies the equation Try = 0, ©' may 
be regarded as an 6-module, 5 = o /(x). Since t r is prime, we 
know that o is a field. (Vol. I, Ex. 2, p. 122.) Hence ©' is a 
vector space over 6. Hence by the theorem on invariance of 
dimensionality (or by the above result for free modules) the num- 
ber of base elements in the two decompositions is the same. 
Thus t — h = t' — h f . 

Now choose tt to be a divisor of Si. Then h = 1 and the equa- 
tion t — h = t r — h r shows that t' > t. Similarly if t r is chosen 
to be a divisor of 5/, then we obtain t > Hence t = We 
now see that, if 7r is chosen to be a divisor of 5i, then h — 1 im- 
plies that h! — 1. Thus t r is also a divisor of 5i'. 

We consider next the submodule i r®. The argument just used 
shows that 

(22) TT© = {tt/ 4 © {7r/fc + i} © • • • © {tt/*} 

= { 7 r/A:' / } © {Ttfk'+l] © * * * ffi Wft] 

where k and k f are respectively the smallest integers such that 
8 k and 8 k > are not associates of w. Then the order ideals of wfi 
are (ei) and those of t rfv' are (e^'), and the ex and the ex / satisfy 
the divisibility conditions. We can now use the induction hy- 
pothesis on the length of the module to conclude that t — k = 
t — k' and that (e*) = (e*/), (e^+i) = (e^+i'), * * *• These rela- 
tions imply that also (8i) = (5/), (8 2 ) = (d 2 '), • • This proves 
the invariance of the (8 t ) for the module ©. 

We consider finally the general case. Then if © is the sub- 
module of elements of finite order, the result just proved shows 
that the number u of fi of finite order is the same as the number 
u r of fi of finite order. Moreover, we have (S*) = (8/) for the 
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corresponding order ideals. Now consider the module 9?/©. It 
is easy to see that this module is free with basis f u +i + ©, • • •, 
ft + © and also with basis f u +i + ©, • • *,// + ©- Hence by 
the result proved for free modules, t — u = t r — u. This com- 
pletes the proof of the following 

Theorem 10. Let 3? = {/j} ©•••©{/«} = {fi} © • • • © 
[ft’’} be two decompositions of an o-module, o a principal ideal 
domain , into cyclic modules ^ 0. Assume that the order ideals 
satisfy conditions (21) and analogous ones for the //. Then t = /' 
and %. = / = 1, 2, •••,/. 

The module theorem just proved can be used to prove the 
uniqueness up to unit factors of the invariant factors of a matrix. 
Thus we have the following 

Theorem 11. If diag { 5 X , 5 2 , • • *, 5 r , 0, • • •, 0} and diag { Si', 
5 2 ', ' ' y 0, ••• ) 0} are equivalent m X n matrices with elements 
in a principal ideal domain o and 8* | Sjfor i < j, 8f | 8/ for k < /, 
then r = r' and 8i is an associate of 8/ for i = 1, 2, • • *, r. 

Proof. Let g be a free module with basis (t iy hy * * y tf)y and 
let 91 be the submodule generated by V\ = Si/i, • • *, v r = 8 r t ry 
v r +i = 0, • • •, v m = 0. Then 5/91 is a direct sum of cyclic mod- 
ules whose order ideals are (8x), ( 8 2 ), • • *, (5 r ) and a free module 
of n — r dimensions. On the other hand, the assumption of 
equivalence implies that we can find a new basis (//, / 2 ', • • •, t n ') 
for 9? and new generators vf = Sftfy vf = 8 2 f t 2 f y ••*, v r *' = 
8 r // r /, Vr'+f = 0, • • *, vA — 0 for 9L This gives a decomposi- 
tion of as a direct sum of cyclic modules with order ideals 
(8 f)y (8 2 '), • • *, (8 r /) and a free module of n — r r dimensions. 
Since the divisibility conditions hold, we conclude from Theo- 
rem 10 that r = r' and (5*) = (8/) for the (8*) 9 ^ (1). This im- 
plies ( 8i ) = (5/) for all i. 

We shall give next a second and purely matrix proof of this 
result. At the same time we shall obtain some useful formulas 
for calculating the invariant factors. 

We observe first that, if ( v ) is any m X n matrix, then the rows 
of ( y)(a ) are linear combinations of the rows of (<r). Hence for 
any j y the /-rowed minors of are linear combinations of the 
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j-rowed minors of (<r). Similarly the 7-rowed minors of (o-)(/z) 
are linear combinations of those of (or). Combining these two 
results we see that the j-rowed minors of (v)(<r)(/i) are linear 
combinations of those of (<r). Now let Ay(<r) be a highest com- 
mon factor of the j-rowed minors of (</). Then our result shows 
that, if (r) = (y)(<r)(/i), then Ay(<r) | A y(r). If (f) and (v) are 
units, then also Ay(r) | Ay(<r). Hence Ay(r) and Ay(o-) are asso- 
ciates. We apply this now to the equivalent normal forms 

diag {5 l5 S 2 , ■ ■ ■, 8 r , 0, ■■ ■, 0} 

diag { 5/, 8 2 ', • • •, 5 r / } 0, • • •, 0}. 

Denote the Ay for these matrices by Ay and Ay' respectively. 
Then because of the divisibility conditions on the 5’s and the 
S'’s, we may take 

Ay = M 2 •••«,• and Ay' = 8f8 2 f • • • 8/. 

Since Ay is an associate of Ay' for each j , it follows that r = r'. 
Also Ai = di is an associate of A/ = 5/. Since 8 X 8 2 is an asso- 
ciate of 8i'd 2 ' we also have that 8 2 and 6 2 ' are associates. Con- 
tinuing in this way we see that 5* and 8/ are associates for each 
/. We have also proved the following 

Theorem 12. Let A y(<r) be a highest common factor of the j-rowed 
minors of (a) and suppose that Ay(cr) ^ 0 for j ^ r. Then the 
elements 8 X — Ai(<r), 8 2 = A 2 (a-)Ai(o’) -1 , • • • y 8 r = A r (o’)A r _i((7)“" 1 
constitute a set of invariant factors for (<r). 

11. Decomposition of a vector space relative to a linear trans- 
formation. We return now to the consideration of a linear trans- 
formation A in a vector space 9t over <£. We apply the above 
results to the <£[X]-module 9? determined by A . Since every vec- 
tor * has an order /z*(X) ^ 0, dt = {/i} © {/ 2 } © • • • © {ft} 
where Q fj = (5y) 9* (0), 7^ (1) and 5y | 8 h if j < h. The invariant 
factor ideals (5y) are uniquely determined. 

If (e\y e 2y • • *, e n ) is a basis for 9t over <l> and eiA = 2a*y£y, 
then ( a ) is the matrix of ^ relative to this basis, and the elements 
Vi = IZaijtj — X/y form a basis for the kernel 9i of the homomor- 
phism T between the free module g and 9 1 Hence the matrix 
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that must be normalized to obtain the invariant factor ideals is 



( 23 ) (a) = XI - (a) 



X — an —« 12 • * * — am' 

— a 2 i X — <*22 * * * — a 2n 

a n i a n 2 * X a nn . 



The normal form of this matrix is 



(24) diag {1,1, • • •, 1, Si, 5 2 , • • •, 

where it is assumed that the leading coefficient of S* = S,-(X) is 1. 
Our results show us also how to obtain a set of /* s such that 9? = 
{/i} © {/ 2 } © * * * © {ft}- If (m) and (y) are units such that 
(?) (< r)(/i) is the normal form, and e{ = 2j Uij*ej, (/**) = (m)“S 
then we may take /* = e n -t+i* 

We obtain a <£-basis for 9i by stringing together <i>-bases in the 
cyclic subspaces {/,•}. If the degree of hi is n iy then (fi,fiA y 
• • •j/i/f* -1 ) is a basis for {/<}. Hence 



is a basis for 9i over <i>. The matrix of ^ relative to this basis 
has the form 

B 2 



(25) 



AJ 



where the diagonal block Bi is the companion matrix of 5*-(X). 
The matrix (25), which is completely determined by the invariant 
factor ideals, is called the Jordan canonical matrix of A . 

A more refined canonical form can be obtained by applying 
the following considerations. Let 

ii(X) = TrtCX^WX)** 1 • • • ^(X)* 1 * 

be the factorization of 5,- into powers of distinct prime factors. 
Then by Theorem 3, p. 70, {/»•} = {fn) © {fa) ® • • • © {/«><} 
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where the order of fij is Wij(\) kii . We choose now a basis in each 
{fij} of the type given on p. 71. These bases together give a 
basis for dt and the matrix of A relative to this basis is 



(26) 




C t \ 



where C* has the same dimensions as Bi in (25) and 



(27) 



C, = 



Cn 



Ci2 



where 



(28) 



'P« 

Nij Pij 



C iSi ) 



kij blocks 






and P^ is the companion matrix of 7r^(X) and has the form 
(10). The matrix thus obtained will be called a classical canonical 
matrix of A. It displays the prime factors 7r^(X) and the expo- 
nents of these primes in the factorization of the Si( A). The ideals 
( Vij(\) k 0 will be called the elementary divisor ideals of A . Also 
the polynomials 7r;y(A)* 17 will be called the elementary divisors of 
(a) = A1 - (a). 

For example, suppose that the invariant factors ^ 1 of Al — (a) 
are A 3 - A 2 - A + 1 = (A - 1) 2 (A + 1) andA 6 - 3A 4 + 3A 2 - 1 
= (A — 1) 3 (A + l) 3 . Then the Jordan canonical matrix is 
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0 1 

0 0 

-1 1 



0 

1 

1 



0 


1 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


1 


0 


0 


0 


0 


0 


0 


1 


1 


0 


-3 


0 


3 


0 



The elementary divisors here are (X — l) 2 , (X + 1), (X — l) 3 , 
(X + l) 3 . Hence a classical canonical matrix is 



1 0 
1 1 



1 


0 


0 


1 


1 


0 


0 


1 


1 



-1 0 0 

1 -1 0 

0 1 -1 



To illustrate the method of obtaining the canonical matrices, 
let A be the linear transformation such that 

<?i A = — e\ — 1e 2 + 6e 3 

e 2 A = — e x + 3e 3 

e 3 A = — e\ — e 2 + 4^ 3 . 
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The matrix (a) here is 



-1 

-1 


-2 

0 


6 

3 


and XI — (a) = 


A + 1 
1 


2 -6 
A -3 


.-1 


-1 


4- 




1 


1 A - 4. 



We have 



0 


1 0 




1 


3 


-3 + A’ 


0 


-1 1 


[A1 - («)] 


0 


0 


-1 


.1 


— X -f - 2 — 3- 




.0 


1 


-1 



Thus 



and 



00 = 









1 




0 




0 






= 


0 




A - 1 




0 








0 




0 


(A - 


- I) 2 - 


1 


3 — 3 -f~ A 






1 


A 


-3' 




0 


0 -1 


> (m)- 1 = 




0 


-1 


1 




.0 


1 -1 






.0 


-1 


0. 




= 


e x J - A ^2 — 3e 


3 &1 ~f“ 


— 


3*3 = 


0 





e 2 ' = —e 2 + <?3 



ez! = —e 2 . 



To obtain the Jordan matrix we use the basis f x = e 2 ,f 2 = ez , 
fz — ez A = e x — 3* 3 . Hence the Jordan matrix is 



1 






0 1 

-1 2 



The matrix that transforms (a) to this matrix is the matrix of 
relative to (e x , e 2 , e 3 ) and this matrix is 

o -i r 
o-i o • 
ll 0 -3. 

We can check that 



0 


-1 


r 




-1 


-2 


6 ‘ 




0 


-1 


r 


—1 


1 


0 


0 


0 


-1 


0 




-1 


0 


3 




0 


-1 


0 


= 


0 


0 


1 


.1 


0 


- 3 . 




.-1 


-1 


4 . 




.1 


0 


- 3 . 




.0 


-1 


2 . 
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We consider now the classical canonical matrices for the case 
where $ is the field of complex numbers, or, more generally, any 
algebraically closed field.* Here the only irreducible polynomials 
of positive degree are the linear ones. Hence the elementary di- 
visors have the form (X — p) k . The block (C<y in (28)) corre- 
sponding to this elementary divisor is 



(29) 



P 

1 P 



k. 



1 pJJ 



The classical canonical form has blocks of this type strung down 
the main diagonal. 

Suppose next that <£ is the field of real numbers. In this 
case the irreducible polynomials of positive degree are the linear 
ones and the quadratic ones X 2 — j8X — y where fi 2 + 4y < 0. 
The elementary divisors are of the forms (X — p) k > (X 2 — /3X — y) k . 
The block corresponding to (X — p) k is (29) and that correspond- 
ing to (X 2 — — y) k is 



(30) 



0 1 
y P 




0 0 
1 0 


0 1 
y P 



0 0 


0 1 


1 0 


y P 



k blocks. 



* A field is said to be algebraically closed if every polynomial with coefficients in the 
field has a root in the field. An equivalent definition is that every polynomial with co- 
efficients in the field factors into linear factors. 
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EXERCISES 

1. Find a classical canonical matrix similar to 



• 17 


-8 


-12 


14' 


46 


-22 


-35 


41 


-2 


1 


4 


-4 


- 4 


-2 


-2 


3. 



and find a matrix that transforms this matrix into the canonical matrix. 

2. Prove the following theorem: 

A necessary and sufficient condition that two matrices (a) and (0) in be similar 

is that XI — (a) and XI — (/3) have the same invariant factors in <£[X] n . 

3. Prove that any matrix is similar to its transposed. 

4. Given the following elementary divisors: (X — l) 3 , (X — 1), (X 2 + l) 4 , 
(X 2 + l) 2 , (X 2 + 1), (X + 2), what are the invariant factors? 

5. Prove that if / is any vector whose order is the minimum polynomial of A , 
then there exists an invariant subspace © (© )A Cl ©) such that 9? = {/}©©. 

6. Let © be an invariant subspace such that the invariant factors of the trans- 
formations induced by A in @ and in 9 ?/© together give all the invariant fac- 
tors of A . Prove that there exists a second invariant subspace U such that 

dt = © © u. 

12. The characteristic and minimum polynomials. Again let 
9J = {/i} © 1 / 2 } © * * • © {ft} where the order ideals of the /* 
are the invariant factor ideals (Si). Thus the order of /*• is S{ 
and Si | Sj if i < j . We know that the minimum polynomial 
/x(X) of A is the least common multiple of the orders of the gen- 
erators /*. By the divisibility conditions this l.c.m. is 8 t (\). 
Hence /z(X) = 5*(X). 

Now let (a) be any matrix of A. We know that, if A n (X) = 
det (XI — (a)) and A n _i(X) is the highest common factor of the 
(n — l)-rowed minors of XI — (a)> then the following relations 
hold 

(31) A n (X) = 5i(X)5 2 (X) • • • 8 t (\) 

(32) M (X) = S t (X) = A n (X) [A n (X)] —1 . 

The polynomial A n (X) is called the characteristic polynomial of (a) 
(or of A). If we refer to (23) we can see that 

(33) A W (X) = X” - afk*- 1 + a 2 \ n ~ 2 + (-1 ) n a n 

where ai is the sum of the diagonal minors of order i in (a). 
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Of particular importance are the first and last as. These are 

(34) ai = an + «22 H b ot nn , oi n = det (a). 

The former is called the trace of (a). The main properties of this 
function of (a) will be considered in § 14. 

Now we know that, if j'(X) is any polynomial such that v{A) 
= 0, then v((a)) = 0. Also if ju(X) is the minimum polynomial 
of A, then /x(X) is the minimum polynomial of (a). Hence our 
results may be stated as the following theorem on matrices. 

Theorem 13. Let (a) be a matrix in <$ n and let A n (X) = det (XI — 
(a)) and /x(X) = A n (X) [A n _i(X)] _1 where A n _i(X) is the highest 
common factor of the {n — 1 )-rowed minors of X\ — (a). Then 1) 
A n ((a)) = m((<*)) = 0; 2) if v(X) is any polynomial such that 
p((a)) = 0, then ju(X) | *>(X); 3) ju(X) and A n (X) have the same prime 
factors. 

The first two statements are clear from what we have proved 
about A. The last statement follows from (31) and the fact that 
all the SfX ) are factors of S*(X) = mW- 
Theorem 13 is a composite of the theorem of Hamilton-Cayley 
on the characteristic polynomial and Frobenius’ theorem on the 
minimum polynomial. Direct matrix proofs of these results will 
be given in the next section. 

EXERCISES 

1. Prove that A is cyclic if and only if XI — (a) has only one invariant factor 

7 ^ 1 . 

2. Prove that, if (a) is nilpotent, then the invariant factors of XI — (a) all have 
the form X m . Hence prove that any nilpotent matrix is similar to a matrix of 

the form * T 

■Ni 

N 2 



where Ni has the form 



0 

1 



Ni = 



NqJ 



1 oJ 
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3. Prove that a matrix with elements in the field of complex numbers is simi- 
lar to a diagonal matrix if and only if its minimum polynomial has no multiple 
roots. 

4. Show that, if (a) is idempotent, then the elementary divisors of XI — (a) 
are either X or X — 1. Use this to prove the result on p. 62 for the case A a 
field. 

5. Show that, if u is a vector ^ 0 such that uA = pu , then p is a root of the 
characteristic polynomial. Conversely, show that, if p is a root belonging to <i> 
of the characteristic polynomial, then there exists a vector u 0 such that uA 
— pu. 

6. A vector u ^ 0 such that uA = pu is called a characteristic vector of A. 
Prove that such vectors always exist if <£ is an algebraically closed field. Prove 
also that, if is the field of real numbers, then any linear transformation in an 
odd dimensional space 9J over $ possesses characteristic vectors. 

13. Direct proof of Theorem 13. From the point of view of 
matrix theory the proof that we have given of Theorem 13 is 
somewhat roundabout. In this section we shall generalize the 
results contained in this theorem, and we shall give direct proofs 
of these results. We suppose first that o is any commutative 
ring with an identity. Let o[X] be the polynomial ring in the in- 
determinate X and consider the matrix ring o[X] n . This ring con- 
tains the subring o n of matrices with elements in o. Also it con- 
tains the matrix 

XI = diag {X, X, • • • X} 

which evidently belongs to the center. Now the essential ob- 
servation for our purposes is that o[X] n = o n [Xl] and XI is tran- 
scendental relative to o n . To see this, let (a(X)) be an arbitrary 
matrix in o[X] n and write 

(35) = QCijo + Otijik + e*;y 2 X 2 + • • • . 

Then if we recall that the product (a)Xl is obtained by multiplying 
all of the elements of (a) by X, we see that 

(36) (<*(X)) = (a) 0 + (a)iXl + (a) 2 (Xl) 2 H 

where ( a) k is the matrix that has in its (i,j) position. Next 

suppose that (a) 0 + (a)iXl + (a) 2 (Xl) 2 H =0. Then the 

(i y j) element of the left-hand side is given by (35). Since this 
element is 0, — 0. Hence every (a)* = 0, and this proves 

the transcendency of Xl relative to o n . 
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If 

</>(X) = 0o + 0i^ + 0 2 X 2 + * * * 

is a polynomial in o[X], then by </>((a)) we shall understand as usual 
the matrix 

/Sol + 0i(«) + 0 2 (a) 2 + • • • 

where in general 0 ;(t) is obtained by multiplying all the elements 
of (7) by 0*. Thus </>((a)) is obtained by replacing XI by (a) in 

0(X)1 = 0ol + 0!(X1) + 0 2 (X1) 2 H . 

If (a) eo n we define the characteristic polynomial A n (X) to be 
the polynomial det (XI — (a)) belonging to o[X]. We shall now 
prove the following 

Theorem 14. (Hamilton-Cayley) If {a) e o n , 0 a commutative 
ring with an identity , and A n (X) is the characteristic polynomial , 
then A n ((«)) = 0. 

Proof. We recall the identity (Vol. I, p. 59) 

(37) [XI - (a)] adj [XI - (a)] = det (XI - (a))l = A n (X)l. 

The matrix adj [XI — (a)] eo[X] n = o n [Xl], and the degrees of its 
elements are <n — 1. Hence 

adj [XI — (a)] = (0)o(Xl)— 1 + (0)i(Xl) n “ 2 + • • • + (0) w -i. 

Also if A n (X) = X" - a{k n ~ l + • • • + (-l) n a n , then 

A n (X)l = (XI)” - a 1 (Xl) n “ 1 + ••• + (-l)W. 

Hence the identity (37) is equivalent to 

(38) [XI - (a)][(0) o (Xl) n—1 + (0) 1 (XI ) n 2 + • • • + (0) n _d 

= (Xl) n - a x (Xl) w - ] 1 + ••• + (— T) n a n l. 

Thus we see that XI — (a) is a factor of A n (X)l in o n [Xl]. By 
the factor theorem (Volume I, p. 99) this implies that 

(a) n - ^(a)"- 1 + ••• + (-1 ) n a n \ = 0 

as required. 

We suppose next that 0 is a Gaussian integral domain, that is, 
a commutative integral domain with an identity in which the 
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unique factorization theorem holds. For such domains we have 
the following 

Theorem 15. (Frobenius) Let (a) e o n , o a Gaussian domain , 
and let A n (\) = det (XI — (a)) and /x(A) = A n (A)[0(A)] -1 where 
0(A) is the highest common factor of the (n — \)-rowed minors of XI 
— (a). Then 1 ) /*((<*)) = 0 , 2 ) if v(\) e o[\] and v{fotf) = 0 , then 
j u(X) | p(X) in o[A], 3) any irreducible factor of A n (X) is a factor of 
m(X). 

Proof. The existence of 0(A) is assured by the fact that o[A] is 
a Gaussian domain. Since some of the (n — l)-rowed minors 
(e.g., the diagonal minors) have leading coefficient 1, we may 
suppose that 0(A) has leading coefficient 1. Clearly 0(A) is a fac- 
tor of A n (A) since it is a factor of all the {n — l)-rowed minors. 
The quotient ju(X) has leading coefficient 1. Now let (7(A)) de- 
note the matrix in o[A] n that is obtained by dividing out the ele- 
ments of adj [XI — (a)] by 0(A). Then by ( 37 ) we have 

( 39 ) [XI - (a)]( 7 (X)) = m(X) 1 . 

By the argument used in the preceding proof, this relation im- 
plies that /x((a)) = 0. This proves 1). 

Now let m*(X) be a polynomial ^ 0 of least degree such that 
M*((<*)) = 0. We may assume that ju*(A) is primitive. Suppose 
that v(k) is any polynomial such that v((a)) = 0. If P is the 
quotient field of o, we may write 

KX) = ?(X)m*(X) + r(X) 

in P[X] where deg r(X) < deg m*(X). Multiplication by a suitable 
element rj 5* 0 in 0 gives a relation 

^(X) = ?1 (X) M *(X) + r x (A) 

where ^i(X) = ^(A) and rfX) = rjr(k) eo[A]. Substitution of (a) 
in this relation gives ri((a)) = 0. Hence by assumption of mini- 
mality for the degree of /z*(A), 77(A) = 0. Thus | 

Since /x*(A) is primitive, m*(X) | v(X). In particular, /x*(A) | m(X), 
and so, since the leading coefficient of ju(A) is 1, we may suppose 
that /z*(A) has this property too. 
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We now write p(A) = p*(A)p(A). Since /**((«)) = 0, by re- 
versing the argument used to prove 1), we see that 

M*00 1 = [XI - (a)](*(X)) 

where (5(A)) e o[A] n . Hence 

A n (A)l = p*(A)p(X)0(A) 1 = [XI - (a)](5(A))p(A)0(A)l. 

If we compare this with (37) and use the fact that XI — (a) is 
not a zero-divisor * in o[X] n , we obtain 

adj [XI - (a)] = (S(X))p(X)0(X)l. 

Thus all the {n — l)-rowed minors of XI — (a) are divisible by 
p(A)0(A). Since 0(A) was assumed to be the highest common fac- 
tor of the ( n — l)-rowed minors, it follows that p(X) = 1. Thus 
/a (A) = m*(X). Statement 2) now follows from what we proved 
above. 

To prove 3) we take the determinants of both sides of (39). 
This gives 

A„(X) det ( 7 (X)) = [ M (X)r. 

Statement 3) is an immediate consequence. 

EXERCISE 

1. Show that the characteristic and minimum polynomials of 



Oio 


Oil 


«2 


as' 


— Oil 




—a s 


a 2 


— Oi2 




ao 


— Oil 


-—Oi3 


—a 2 


ai 


ao - 



are, respectively, 

(A 2 — 2a 0 A + (ao 2 + m 2 + a 2 2 + « 3 2 )) 2 

and 

A 2 — 2a 0 A + ( ao 2 + oil 2 + a 2 2 + as 2 ). 

14 . Formal properties of the trace and the characteristic poly- 
nomial. We have defined the trace of (a), tr (a), to be the nega- 



Its determinant is A n [X] 0. 
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tive of the coefficient of X n_1 in the characteristic polynomial of 
(a). From this it follows that 

(40) tr (a) = an a 2 2 + * * * + aw 

If the characteristic polynomial A n (A) = II(\ — pi) in some ex- 
tension field, then the p* are called the characteristic roots of (a) 
in this field. The coefficients of A n (\) are the so-called elementary 
symmetric functions of the p;. In particular 

(41) tr (a) = pi + P2 + P3 + * * * + Pn« 

From (40) it is clear that the trace function is linear: 

(42) tr [(a) + (0)] = tr (a) + tr (0). 

tr (p(a)) = p tr (a). 

We can also verify the following property 

(43) tr (a)(0) = tr (0)(a). 

For the (z, i) element of (a)(0) is ^ &ij0ji> Hence 

j 

tr (a) (J3) = X) aifiji 

and this is symmetric in (a) and (0). 

The result we have just noted holds also for the other coeffi- 
cients of the characteristic polynomial. Thus (a)(0) and (0)(a) 
have the same characteristic polynomials. This, too, can be 
verified directly. However, the following indirect proof has some 
elements of interest of its own. 

We shall suppose more generally that (a) is an m X n matrix 
and (0) an n X m matrix with elements in a field. Then multi- 
plication in either order is possible, and the resulting matrices 
(a)(0) and (0)(a) are respectively m X m and n X n matrices. 
Assuming that n > m y we shall show that the characteristic 
polynomial of (0)(a) = \ n ~ m times the characteristic polynomial 
of (a)(0). We assume first that (a) has the following form 

(44) (a) = diag {1, • • •, 1, 0, • • •, 0} 
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Then if 



08 ) = 



$11 012 
021 022 



01m] 
02 





nl 


0n2 


0nm-' 




■jSn 


012 


01m 


(«)(0) = 


0rl 


0r2 


’ • • 0rm 




0 


0 


... 0 




-0 


0 


• • • 0 . 



■011 • • • 


01r 


0 


. . . O' 


021 


02 r 


0 


... 0 


-0nl 


0»r 


0 


... 0. 



( 0 )(*) = 



Hence if ^(X) denotes the characteristic polynomial of 

("011 012 * • * 01r] 



L0rl 0r2 * * ’ 0rrJ 

then the characteristic polynomials of (a)(0) and of (0)(a) are 
respectively \ m ~ r g(K) and X n_r 4 g‘(X). This proves our assertion. 

We suppose now that ( a ) is arbitrary. There exists a matrix 
(/i) in L(<£, m) and a matrix ( v ) in L(<i>, n) such that (ii)(a)(v) 
= (a)i has the form (44). Set (^)“ 1 (0 )(m)~ 1 = (0)i- Then by 
what we have just proved, the characteristic polynomial of 
(0 )i(gOi is X n ~ m times that of (a)i(0)i. On the other hand, 

(«)i(0)i = (mJWWW^Os)^)- 1 = (m)(«)(0)(m)" 1 

is similar to (a)(0) and 

(0)iWi = = W" 1 (0)(«)W 




106 THE THEORY OF A SINGLE LINEAR TRANSFORMATION 



is similar to (/3)(a). Hence (a) (13) and (a)i(0)i have the same 
characteristic polynomials, and (13) (a) and (j8) 1 (oj) i have the same 
characteristic polynomials. The asserted result therefore holds 
for (a) ((3) and (P)(a). This proves the following 

Theorem 16. Let (a) be an m X n matrix and (0) an n X m 
matrix with elements in a field . Then if n > m, the characteris- 
tic polynomial of (I3)(a) is X n-m times the characteristic polyno- 
mial of (a) ((3). 



EXERCISE 

1. (Flanders) Show that the elementary divisors not divisible by X of (a)(/3) 
and (/ 3)(a ) are the same. Also obtain a relation for the elementary divisors 
which are powers of X. 

15. The ring of o-endomorphisms of a cyclic o-module. In 

the remainder of this chapter we shall consider the problem of 
determining the linear transformations that commute with a 
given linear transformation and the generalization of this prob- 
lem to modules. 

If A is a linear transformation in 9? over <£, the totality 33 of 
linear transformations that commute with A is a subalgebra of 
2. If B e 33, then aiB = Ba x and AB = BA. Hence B com- 
mutes with every linear transformation 0 O z + /3uA + I3 2 iA 2 + • • • 
belonging to the subalgebra 21 = $i[A] generated by 1 and A. 
Conversely, if B is any endomorphism in the group 9? that com- 
mutes with every element in 21, then B e33; for aiB = Bai for 
all a so that B is a linear transformation and BA = AB. 

If 9? is regarded as a <£[X]-module as before, then 

x(Poi + iA + • • •) = <t>(\)x 

where <£(X ) = 0 O + 0iX + • • • . Hence, an endomorphism B com- 
mutes with every linear transformation belonging to 21 if and 
only if (4>(\)x)B — <t>(X)(xB). Hence 33 coincides with the set of 
$[X]-endomorphisms of 9£. We are therefore led again to adopt 
the module point of view, and to consider the problem of deter- 
mining the set 33 of o-endomorphisms of any finitely generated 
o-module, o a principal ideal domain. It is evident that 33 is a 
subring of the ring of endomorphisms of the group 9 1. 
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As usual we denote the ring of the endomorphisms ai: x — > ax y 
a in o, by o*. This ring is a homomorphic image of o and so it is 
commutative. It follows that Oz C 33 the ring of o-endomor- 
phisms. It is also clear that Oz is contained in the center of 33. 
It should be remarked that in the special case of the <£[X]-module 
determined by a linear transformation A y the ring <I>[X]z is just 
the ring 21 = $i[A], 

We consider first the problem of determining 33 in the special 
case of a cyclic o-module. It is not necessary to assume here 
that o is a principal ideal domain, but only that o is a commuta- 
tive ring with an identity. We have the following 

Theorem 17. If o is a commutative ring with an identity and 
9t is a cyclic §-module y then the only s-endomorphisms of 9? are the 
mappings x — > ax. 

Proof. Let 9i = {e} and let B be an o-endomorphism of 9L 
Suppose that eB = fie . Then if x = ae y xB = ( ae)B = a(eB) 
= fi{ae) = fix. Thus B = fii. 

Corollary. If A is a cyclic linear transformation in a vector 
space over a field y the only linear transformations that commute with 
A are the polynomials in A ( with coefficients in $z). 

EXERCISE 

1. Show that the matrices that commute with 

0 

1 0 
1 • 

L 1 0. 

are given by 

r « 0 

OLi ao 
Oil 

-Oi n - 1 * * * Oil « 0 J 

where the are arbitrary in <£. 
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16. Determination of the ring of o-endomorphisms of a finitely 
generated o-module, o principal. We suppose now that 91 is a 
finitely generated o-module, o a principal ideal domain. We know 
that 9? = {/i} © {/ 2 } ©•••©{/*} where, if the order ideal Q fi 
= (Si), i = 1, 2, • • •, /, then 

(45) 8j = 0 if j > u and hi | Sj if i < j for all /, j. 

Let B e 33 the ring of o-endomorphisms of 9J, and suppose that 
fiB = gi y i = 1, 2, •••,/. Then if x is any element of 9J, x — 
2&/*, £; in o. Hence 

xB = (2£ifi)B = 2(£ifi)B = 2f»(/ t -2?) = 

Thus B is completely determined by its effect on the generators 
ft of 9 1. We note next that since 5*-/* = 0, higi = Si(/iB ) = 
( hi/i)B = 0. Hence if (e*) is the order ideal of g iy then e* | 5*. 

Conversely, suppose that for each /, gi is an element of 9 i 
whose order ideal (e*) satisfies the condition | 8{. Define B to 
be the mapping 2£*/* — > 2£^. Then we assert that B e33. We 
show first that B is single-valued. For suppose that 2£*/; = 
are two representations of the same element. Then 2 (£*• — 77;) /* 
= 0. Hence 8i | (& — rji). Consequently e* | (£; — 77*), and this 
implies that 2(& — 77 *),§■; = 0, or 2 ^gi = 2 rjigi. This shows that 
the results obtained from the two representations are equal. 
The verification that B is an o-endomorphism is now immediate. 

Our result is the following: There is a 1-1 correspondence be- 
tween the elements B and the ordered sets (g iy g 2 , • • *, gt) of 
elements gi whose order ideals (e*) satisfy the condition e* | 8i. 
We now set gi = 2/Jyy/y, Pij e 0, and we associate with the ordered 
set (gi y g 2 , • • *, g*) the matrix 

P 11 P 12 * * * Pit 

^21 ^22 ' * * $2t 

Ptl Pt2 * * * Ptt- 

in the ring o* of t X / matrices with elements in 0. This matrix 
is not uniquely determined. For any Pij may be replaced by a 
Pi/ such that &/ = j3iy(mod 8 y). This is the only alteration that 
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can be made without changing the g{. Hence we may say that 
the elements of the^-th column of (/3) are determined modulo bj. 
The condition e* | b iy or what is the same thing, bjgi = 0, is 
equivalent to the equations 



(47) 



bifiij = 0 (mod bj). 



This, of course, means that there exist 7 # such that bifaj = jijbj. 
Hence (47) is equivalent to the following condition on the matrix 
(j3) of (46). There exists a matrix ( 7 ) such that 



(48) 



81 



SJ 





'0n 


@\2 


••• 01 


t 




021 


@22 


••• 02 


t 




Ai 


@t2 


* * * fitt- 


Til 


7 12 


• • • 7i* 




721 


722 


• * * 72 1 






7*2 


• * * 7 tt. 





r«i 



The totality 2)1 of matrices (@) which satisfy this condition is a 
subring of the matrix ring o t . The matrix (fi) determines an ele- 
ment B of 33 such that fiB = 2J/3 t -y/y. It is easy to verify that the 
correspondence (j 3) — > B is a homomorphism between 2 ft and S 3 . 
Now the endomorphism B determined by (/3) is 0 if and only if 
Pij = 0 (mod bj). Hence the kernel of our homomorphism is the 
set 21 of matrices (v) in which is a multiple \iijbj. Thus Be 2 t 
if and only if there exists a (m) in o t such that 



(49) « = (m)(8 ) 

where (5) = diag { 5i, 5 2 , • • •, S*}. The ring S3 is isomorphic to 
the difference ring 2 ft/ 2 t. 

Theorem 18 . Let 91 = {/ 1 } © {/ 2 } ® • • • © {/*} where the 
order ideal of fi is (5*). Then the ring S3 of o-endomorphisms of 9t 
is isomorphic to the difference ring 2 ft/ 9 i where 2 ft is the subring of 
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0 * of matrices ( 0 ) for which a ( 7 ) exists such that (48) holds and 9? 
is the ideal of matrices (y) for which a (f) exists such that (49) holds . 

An explicit determination of the matrices of 3D? can be made if 
use is made of the conditions (45) on the 5’s. We note the fol- 
lowing cases of (47) : 

1. / > j. Here 8* = 0(mod 5y). Hence these 0# are arbitrary. 

2 . / < u y j > u. Here (47) and (45) imply that these 0 # = 0 . 

3. i y j > u. Here 8 t - = 8 y = 0 and the 0 ’s are arbitrary. 

4. i <j < u. Let rjij = Then (47) is equivalent to 

the condition 0 # = 0 (mod raj). 

Thus ( 0 ) has the following form: 



(50) 



‘flu 

021 



012 

022 



0w 1 0^ 



0 u + l,l 014 + 1,2 



0£2 



01 u 

02 u 

fiuu 



0W + 1.W 



Ptu 



0W+1.W 



+1 



0< t M 



+ 1 



^14 + l,f 



fitt 



where all the 0 ’s are arbitrary except above the main diagonal in 
the upper left-hand block. Here 0 # = Vijrjij where ^ is arbi- 
trary and rjij = x 5y. The condition 0 *y = 0(mod Sy) for these 
0 ’s is equivalent to Hij = 0 (mod 8 *). 

17. The linear transformations which commute with a given 
linear transformation. We specialize 9 ? to be the <£[A]-module 
determined by the linear transformation A. Here every 5* 0 

so that u = t. The ring 3D? now consists of all the matrices ( 0 ) 
in which the 0 # are arbitrary if i > j and 0 # = Hijriij, Vij = S** 1 5y 
for i < j . Any j 8 # may be replaced by 0 *y' in the same coset 
(mod 5y) . Consequently mo' may be replaced by /z t -y' in the same 
coset (mod 5*). Thus if w* = deg S t * then we may suppose that 

deg 0 *y < »y if / > j 

deg j uu < m if / < j . 




THE THEORY OF A SINGLE LINEAR TRANSFORMATION 111 



A matrix in 9 ft that satisfies these conditions will be called a 
normalized matrix. It is clear that two normalized matrices de- 
termine the same B in the ring S3 of o-endomorphisms if and only 
if they are identical. Hence the correspondence (/3) — > B deter- 
mined before is 1-1 between the set U of normalized matrices and 
the ring S3. 

We know that S3 3 21 = $i[A] 3 It follows that S3 is a 
subspace of the vector space 8 over <3>. We wish now to compute 
the dimensionality of S3 over <£. 

We note first that the normalized matrix corresponding to the 
scalar multiplication on is the scalar matrix al = diag {a, a> • • •, 
a) of t rows and columns; for we have the relations fioci = afi. 
Also it is clear that the set U of normalized matrices is closed 
under addition and under multiplication by scalar matrices. It 
follows that U may be regarded as a vector space over <t>. Here 
a(j3) for (j8) in U is defined to be the matrix al(/3). If B^ i = 1, 2, 
is in S3 and Bi — > (j8<) eU, then clearly B x + B 2 — > (/3i) + (jg 2 ) 
and aBi — a(/3i). Since our correspondence is 1-1, this shows 
that S3 over <I> is equivalent to U over <3>. We now determine the 
dimensionality of U. 

Let U ij denote the subspace of U of normalized matrices in 
which fiki — 0 for all {k y l) 9^ (i,j). It is easy to see that if 
i >h dim U ij y the dimensionality of the space of polynomials of 
degree < nj is nj. Similarly if i < j y dim U# = Since U is a 
direct sum of the subspaces U»y, 

t t — i 

dim U = 2 (/ — j + 1 )ttj + X) if ~ *)«» 

3 = 1 1 = 1 

= ( 2 / — 2 j + 1 )nj. 

3=1 

This proves the following 

Theorem 19. ( Frobenius ) Let ( a ) e$ n and let 5i(X), 5 2 (X), • • •, 
§*(X) be the invariant factors 1 of XI — (a). Then if the degree 
of 5 Z *(X) is ni, the maximum number of linearly independent matrices 
commutative with (a) is given by the formula 

N=£(2t- 2 ; + 1 )m. 

3=1 
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t 

Clearly, if t > 1, then N > n j = n > n t- Since the dimen- 

3=1 

sionality of 21 = &i[A] over is n t > this shows that, in this case, 
33 3 21. If we recall that t = 1 is the condition that A be cyclic, 
we obtain the following converse of the corollary to Theorem 17. 



Corollary. If A is a linear transformation which is not cyclic , 
then there exist linear transformations that commute with A which 
are not polynomials in A . 

Example . Let 0 0*1 

(a) = 0 0 1- 

.0 -1 2 . 



Here, if A is the corresponding linear transformation, then 9? = {/i} © {/ 2 b 
The invariant factors are 81 = X — 1 and 82 = (X — l) 2 . The general form of a 
normalized matrix ( 0 ) is 

rfti 0i 2 (X - i) 1 
tfti P22 + 



Since X/i =/i andX 2 / 2 = (2X - l)/ 2 = ~h + 2 (X/ 2 ), 
f\B = 0 nfi — ^12/2 + / 3 i 2 (X/ 2 ) 

/ 2 5 = ^ 2 l/l + 022 J 2 + p22 f Of 2) 

(X/ 2 )R = ftn/l — ^ 22^2 + (022 + 2^ 22 ')(X/ 2 ) • 

It follows that the general form of a matrix commutative with (a:) is 

'fill — fin fin 

fi2l fi22 022 

~02l — 022 022 + 2022 - 



EXERCISES 

1. Let 9t be a finite commutative group and suppose that 9? is a direct sum of 
cyclic groups of orders n h « 2 , • • • , w* where rii | rij for i < j . Prove that the num- 
ber of elements in the ring of endomorphisms of 9? is 

N = A a,*-*”- 1 . 

?=1 

2. Determine the matrices that commute with 

-0 0 0 0 0 ' 

1 0 0 0 0 

0 0 0 0 0- 

0 0 10 0 
.0 0 0 1 0. 
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3. Determine the matrices which commute with 

■10 0 Oi 

0 0 10 
0 0 0 1 
.0 1 -3 3. 

18. The center of the ring 33. We return to the general case 
where 9t is a finitely generated o-module, o a principal ideal do- 
main. As before, let 9 1 = {/i} © { ^* 2 } ® * * * © {/*} where the 
orders (5*) satisfy (45). We shall prove the following 

Theorem 20. The center of the ring of ^-endomorphisms of dt 
consists of the scalar multiplications . 

Let 33 be the ring of o-endomorphisms, E its center and o* the 
ring of scalar multiplications x — » ax. We have seen that 0 1 C E. 
Now let C be any element of 6. Let E ky k = 1, 2, ••*,/, be the 
o-endomorphism such that fjE k = Sj k f ky j = 1, 2, ••*,/. By the 
considerations of section 16, such endomorphisms exist in 33. 
Also there exists o-endomorphisms E tk such that fjE tk = Sj t f k . 
Since C commutes with these endomorphisms, we have the fol- 
lowing equations 

fiC = C ftE t )C = (. f t C)E t = 7 /„ 7 e <r 

f k C = (f t E tk )C = (ftC)Et k = (7 ft)E tk = 7 (ftE tk ) = 7 U 

Thus C coincides with the mapping x — > yx. Hence S = Oi. 

If C is any endomorphism in 9t commuting with every element 
of 33, then, in particular, C commutes with every element of o*. 
Hence Ce33. Thus C is in the center of 33. The converse is 
clear. This remark enables us to state Theorem 20 in the follow- 
ing alternative form: 

Theorem 20'. The only endomorphisms of 9? which commute 
with every 0 - endomorphism are the scalar multiplications . 

This specializes to the 

Corollary 1. If C is a linear transformation that commutes with 
every linear transformation which commutes with A, then C is a 
polynomial in A. 

This corollary enables us to determine the center of the com- 
plete ring 2 of linear transformations. For let A = 1. Then the 
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ring 33 of linear transformations that commute with A is the com- 
plete ring 8. Hence Corollary 1 states that the only linear trans- 
formations that commute with every linear transformation are 
the polynomials in 1. Since a linear transformation is expressible 
as a polynomial in 1 if and only if it is a scalar multiplication, 
this gives the important 

Corollary 2. The center of the ring of linear transformations of 
a vector space over a field is the set of scalar multiplications . 

A slightly more direct proof of this result will be given later 
(Chapter VIII, p. 229). 



EXERCISES 

1. Prove that the center of the ring of endomorphisms of a finite group con- 
sists of the endomorphisms x — > mx , m an integer. 

2. Prove that a linear transformation A is cyclic if and only if the ring 33 of 
linear transformations commutative with A is commutative. 

3. Prove the following extension of Theorem 20': The only endomorphisms of 
9? which commute with every idempotent O-endomorphism are the scalar mul- 
tiplications. 
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In this chapter we shall introduce some general concepts which 
are fundamental in the study of arbitrary sets of linear transfor- 
mations. A deeper study of these notions belongs more properly 
to the so-called theory of representations of rings and is beyond 
the scope of the present volume. An introduction to these notions 
will serve to put into better perspective the results of the pre- 
ceding chapter. We shall also be able to extend some of these 
results to sets of commutative linear transformations. 

1. Invariant subspaces. For the most part we shall be concerned 
in this chapter with the general case of a vector space over a 
division ring A. Let be a finite dimensional vector space over 
A and let 12 be a set of linear transformations in 9? over A. If 
(tiy e 2 , *• • *, e n ) is a basis for dt and A e 12, then eiA = and 

(a) is the matrix of A relative to the given basis. The matrices 
(a) determined in this way by the A ett constitute a set co that 
we shall call the set of matrices of 12 relative to (e u e 2y • • e n ). If 
C/i>/ 2 > * * * yfn ) is a second basis and /»• = 2/^y, then the matrices 
of 12 relative to this new basis is the set { (ju) (a) (m) _1 } > («) in co. 
We denote this set as (/x)co(ju) -1 . 

From the geometric point of view a fundamental problem in 
the study of a set of linear transformations is that of determining 
the invariant subspaces relative to this set. As in the case of a 
single linear transformation a subspace © is called invariant under 
12 if (BA C B for every A e 12. If 12 consists of a single transfor- 
mation A y then the cyclic subspaces {*} are examples of invariant 
subspaces. Other simple examples are the following: 

1. 12 consists of the linear transformation 0. Here any sub- 
space is invariant since ©0 = 0 C (© for any ©. 
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2. 12 = 2 the complete set of linear transformations. Here the 
only invariant subspaces are the zero space 0 and the whole 
space 9 t. Thus let © be an invariant subspace 7^ 0 and let y 
be a non-zero vector in ©. Then if x is any vector in 9?, there 
exists a linear transformation A such that yA = at. Since © is 
invariant x = yA e ®A C ©. Hence # e © and since x is arbi- 
trary © = $R. 

Obviously the whole space SR is an invariant subspace relative 
to any set 12 . Also the 0 space is invariant since 0 A = 0 for any 
linear transformation A. The second example above shows that 
there exist sets 12 for which these two “trivial” subspaces are the 
only invariant subspaces. Such a set is called an irreducible set. 
It is also convenient at times to say that dt is irreducible relative 
to the set 0 . 

Reducibility, or the existence of a proper (5* 0 , 5*^ 9?) invariant 
subspace manifests itself as a simple condition on the sets of 
matrices of 12 . Suppose © is a proper invariant subspace and let 
(fiyfz, * • -,/n) be a basis for 9 ? such that (/i,/ 2 , • • *,/ r ) is a basis 
for ©. Since © is invariant, /{A e © for each / = 1 , 2, • • • , r 
and each A e 12 . Hence the relations that give the matrix A 
relative to (/1 ,/ 2 , • • - ,/ w ) are 

M= Zfti/y, /= 1,2, ..-,r 

a) 

n 

fkA = X £ = r + 1, r + 2, •••,». 

f=i 

Hence the matrix (/3) of has the form 



'/3u 


@12 


• • • 01r 


0 


0 


... 0 


@21 


@22 


02 r 


0 


0 


... 0 


@n 


@r2 


• • ' 0rr 


0 


0 


... 0 


@r + l,l 


@r + 1,2 


0r + l,r 


0r+l,r + l 




0r+l,?i 


~@nl 


@n2 




0n,r + 1 




• • • 0n,» - 



( 2 ) 
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A matrix that has an r X n — r block of 0’s in its upper right- 
hand corner is said to have reduced form. Hence the existence of 
a proper invariant subspace implies the existence of a basis rela- 
tive to which the matrices of Q, all have reduced form. Another 
way of putting this is that, if is the set of matrices of £2 relative 
to some basis (*i, e 2 , • • *, e n )> then the existence of a proper in- 
variant subspace implies that there exists a non-singular matrix 
(n) such that all the matrices of (/x)co(m) - 1 have the same reduced 
form (that is, with the same r for all). 

The converse is valid, too. For suppose that all of the ma- 
trices of (^t)co(/x) 1 have the reduced form (2). Then, if /*• = 
the matrices of £2 relative to C/ 1 ,/ 2 , * * - ,/ n ) constitute the 
set (m)co(/x) _ 1 * Because of the form of these matrices, relations 
(1) hold, and these show that the space © = [fu/ 2 , • • - ,/ r ] is 
invariant under 12. 

EXERCISES 

1. Let 0 be a set of linear transformations, and let B be a linear transformation 
that commutes with every A e 12. Show that if © is an invariant subspace rela- 
tive to 12, then ©5 is also invariant. Also, show that the subset of vectors 
y e © such that yB = 0 is invariant relative to 12. 

2. Prove that a subspace © is invariant relative to a set 12 if and only if the 
following operator condition holds: The relation EAE — EA holds for every 
projection E onto © and every A 8 12. 

2. Induced linear transformations. If © is a subspace invar- 
iant under 12, then the linear transformations A e $2 induce trans- 
formations in ©. It is evident that these transformations are 
linear. We shall now show that in a certain sense the A ei 2 also 
induce linear transformations in the factor space 3{ = 9?/©. We 
recall that a vector x of S is a coset consisting of the vectors of 
the form x + y where x is fixed and y ranges over ©. If A s 12, 
(x + y)A = xA + yA = xA + y' where y f = yA e ©. Hence 
the image of any vector in the coset x is a vector in the coset 
xA determined by xA. Thus the mapping A that associates 
with x the vector xA of is single-valued. We call A the trans- 
formation induced by A in This mapping is linear, since 

(xi + x 2 )A = ( xi + x 2 )A = ((#1 + x 2 )A) = (jx \A + x 2 A) 

= (xi A) + (x 2 A) = Xi A + x 2 A 
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and 

{ax) A = {ax) A = {{ax) A) = {a{xA)) = a{xA) = a(xA). 

When there is no risk of confusion, we shall simplify matters by 
denoting the induced transformation by A also. 

On the other hand, it is sometimes necessary to distinguish 
carefully between the transformations A and A. The precise re- 
lation between these can be made explicit in the following way: 

Consider the mapping P: x x = x + <& of 9t onto It 
is immediate from the definitions of the compositions in 91 that 
P is a linear transformation of 9? onto We shall refer to P as 
the natural mapping of 9t onto Now we have defined A by 
the rule that xA = xA. Thus we have the relation PA = AP 
connecting A and A. 

Now suppose the basis (/i,/2, * • * 3 /n) is chosen as in the pre- 
ceding section so that (/i 3 / 2 3 * • - ,/ r ) is a basis for ©. Then it 
is clear from the first set of relations in (1) that the matrix 



'flu 


@12 


@lr 


@21 


@22 


@2 r 


-@rl 


@r2 


• • @rr - 



is a matrix of the linear transformation induced by A in the in- 
variant subspace ©. Moreover, by the second set of equations 
in (1) we have 

JkA = (JkA) = f^2@ki/i) = ^2@kifi = 'jLt&kifb 

\ 1 / 1 r+ 1 

Now we know that the vectors (/ r+ i,/ r+2 > • • •,/») form a basis 
for Hence these relations show that the matrix of A relative 
to this basis is the lower diagonal block which appears in (2): 

@r-\-l,r + l ’ @r + l,n 

@r+2,r + l * * * @r+2,n 

-@n,r + l * * * @nn 
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We have now found interpretations for the two diagonal blocks 
which occur in (2). It is natural to ask for the meaning of the 
remaining block, appearing in the lower left-hand corner. To ob- 
tain the significance of this block we observe first that [ f r +i , • • - , 
/„] is a complement U of the space ©. The decomposition 
9? = ©®U defines a projection E of 9? onto ©. Since © is in- 
variant relative to £2, EAE = EA holds for every A e fi (Ex. 2, 
p. 117). We consider now the linear transformation AE — EA 
= A E — EAE. This transformation sends (ft into ©. More- 
over, if y e ©, then y(AE — EA) = yA — yA = 0. It follows 
from this that AE — EA defines a linear transformation A E of 
(ft = (ft/© into ©. Thus we define 

xAe = x{AE — EA), 

and, by the remark that we have made, we see that A E is single- 
valued. One verifies directly that A E is a linear transformation 
of (ft/© into ©. Moreover, by definition we have the relation 
AE - EA = PA e . 

We shall now show that the matrix 

f^r+1,1 * ' ’ ^r+l.rj 



' ' ‘ ftn,r -* 

is the matrix of A E relative to the bases (/ r + 1, • • •,/«), (/i, • * 
f r ) for (ft and @. We have the relations 

JkA E = f k {AE - EA) = f k AE 

r 

= ^ fikif i) k = T \ y * y 
i= 1 

which prove our assertion. 



EXERCISES 

1. Prove that a relation such as A + B = C or A B = C for A, B, C in 12 im- 

plies a corresponding relation A + B = C, AB = C for the induced linear trans- 
formation in 9?. _ . — 

2. Show that A is 1-1 in 9J if and only if A is 1-1 in © and A is 1-1 in 

= SR/@. 
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3 . Composition series. Let © and U be invariant subspaces 
such that 9t 3 U ^ ©. Then U = U/© is a subspace of = 
9?/©. If £ etl, & = u + © where us U. Hence for any A in 12, 
uA = uA eU. This shows that U is invariant relative to the set 
12 of the induced linear transformations A in 9?. 

We shall now show that the converse holds, namely, any in- 
variant subspace of has the form U = U/© where U is an in- 
variant subspace of 9 i containing ©. Thus let U be a subspace of 
invariant relative to 12. Let U be the totality of vectors con- 
tained in the cosets that belong to U. If Ui and u 2 ell, = U\ 
+ © and u 2 = u 2 + © are in U. Hence iii + u 2 = {u x + u 2 ) 
+ © is in U. Hence U\ + u 2 e U. Similarly au s U for any a in 
A and any u in U. Since (u + < 3 )A = uA + © e U, uA s U. 
Thus U is an invariant subspace of 9 1 . Clearly U = U/©. 

A sequence of invariant subspaces 

(3) 0 c ©! c © 2 c • • • c ©« = dt 

is called a composition series for 9 1 relative to 12 if each © t - is irre- 
ducible over ©;_i in the sense that there exists no invariant ©' 
such that @ t O ©'3 ©*•__!. By what we have shown it is clear 
that ©»• is irreducible over © t -_i if and only if ©*•/©*_! is irre- 
ducible relative to the set of linear transformations induced by 
the A s 12. The irreducible spaces 

(4) ®i,© 2 /©i, •••, ©*/©,-! 

are called the composition factors of the series (3). 

We now choose a basis (/i ,/ 2 , • • -,/ ni ) for ©i. This can be 
supplemented to a basis (/i,/ 2 , ■ • *,/ ni+n2 ) for © 2 . Continuing 
in this way we obtain finally a basis (/i,/ 2 , • • *,/ n ) for 9 i such 
that (/i,/2, * • •,/ ni+ ... +ni ) is a basis for ©*. Then if A s 12 

m 

fiA = 22 Pijfjy i — I5 2 , • * *, 77 1 

1 

ni + ri2 

( 5 ) f k A = X) 0 mA £ = »j + 1 , •••,»! + » 2 



wiH 1 ~n t 

fpA = X) Ppqfq, P ~ n \ + ‘ ' - + Kt— 1 + lj • • 

1 »1 +•••+»« = » 
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Consequently the matrix of A relative to our basis has the form 



( 6 ) 



’ 03 x) 0 

* (fo) 



••• G 80 J 



in which the “blocks” above the main diagonal are all 0 . The 
cosets (,/ni-f- • • • + n i_i+i> fni+ • • • +n *> form a basis 

for the factor space ©,•/©,•_ j, and by ( 5 ) we have 

__ ni+***+ni 

JrA = E /W« 

Wl+ • * • 1 + 1 

if A denotes the transformation induced by A in the factor space 
©*/©*•_ i. The matrix of A relative to this basis is therefore the 
diagonal block (ft) that appears in ( 6 ). Our assumption that 
©* is irreducible over ©*_i means that it is impossible to find a 
matrix (/z») such that all of the matrices (/**) (ft) (m») ~ 1 all have 
the same reduced form. 

It is easy to prove the existence of a composition series for any 
set 0. First, if SR is irreducible, then 0 ci SR is such a series. 
Otherwise let © be a proper invariant subspace. If © is irre- 
ducible relative to the set of induced linear transformations, we 
take ©i = ©. Otherwise we let ©' be a proper invariant sub- 
space of ©. Now dim SR > dim © > dim ©'. Hence this proc- 
ess cannot be continued indefinitely. Eventually we get an irre- 
ducible invariant subspace ©i ^ 0. We repeat this argument 
with SR = SR/©i and the set 0 of induced transformations. Then 
we see that, if SR ^ 0 (that is, SR ^ ©i), SR contains an irreducible 
invariant subspace © 2 9^ 0. This space has the form © 2 /©i, and 
© 2 is invariant relative to 12 and is irreducible over © x . We con- 
sider next SR/© 2 . If this space is ^ 0, we obtain in the same way 
a subspace ©3 invariant under 12 and irreducible over © 2 . Since 
dim ©1 < dim © 2 < dim © 3 < • • *, this process, too, breaks off 
after, say, t steps, with the space SR. Then 0 c © x c © 2 c ••• 
c ©f = SR is a composition series. 




122 



SETS OF LINEAR TRANSFORMATIONS 



4. Decomposability of a set of linear transformations. We 

consider now a decomposition of the space 9? as 

(7) 9J = 9?i ® 9?2 © * * * © 

a direct sum of the subspaces 9? z - that are assumed to be invariant 
relative to 12. Such a decomposition is called proper if each 
dti 0 and s > 1. If 9? has a proper decomposition, we say that 
9? is decomposable relative to 12 and that 12 is a decomposable set of 
linear transformations. 

Clearly, a set of linear transformations which is irreducible is 
indecomposable. On the other hand, there exist reducible sets 
which are indecomposable. Hence decomposability is an essen- 
tially stronger condition than reducibility. We prove this asser- 
tion by citing the following 

Example. Let 12 consist of the linear transformation A with matrix 

-o 

1 0 
1 • 

1 0 . 

relative to the basis (ei, e 2 , • • •, e n ). Then e± A = 0 and eiA = ei-\ if i > 1. 
Hence the subspaces ©»• = [e\> £ 2 , • • •, *i], / = 1, 2, are invariant. We 

shall show that these are the only non-zero invariant subspaces relative to A. 
For let © be such a subspace. Let h be the smallest integer such that © Cl © A . 
Then © contains a vector 

7 i*i + 72*2 H b 7 heh 

with 7 h 0. We may assume that yu = 1 so that 

y = 71*1 H b yh-ieh-1 + eh 8 ©. 

Then the vectors 

y A = 72*1 H — • + yh-ieh-2 + eh- 1 
yA 2 = 73*1 H b 7 *-i*a -3 + e h -2 



are in ©. Evidently ei, e 2 , • • • , ^ are linearly dependent on these vectors. Hence 
©/> C © and so © = ©*. Since ©; CZ ©y if / < j> it is clear that no two of 
these spaces are independent. Hence 9i cannot be written as a direct sum of 
these invariant subspaces. 
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If 9? is a direct sum of the invariant subspaces 9t*, i = 1, 2, • • •, 
s, s > 1, we may choose a basis (/i,/ 2 > * * •,/») for 9? such that 



( 8 ) 



C /" ni + • 



i + l3 ./* ni + • * • +Wt— 1+23 



3 »/wi+ • • • + n{) 



is a basis for 3?/. Since 9?; is invariant, any vector in (8) is trans- 
formed into a linear combination of these vectors by any A eft. 
Hence the matrix of A has the form 



( 9 ) 



w 

(ftO 



L G8.)J 

Here the diagonal block is the matrix of the transformations in- 
duced by A in 9^- relative to the basis (8). Conversely, if 0 is 
any set of linear transformations and there exists a basis C/ 1 ,/ 2 , 
• * -3 fn) such that all the matrices relative to this basis have the 
form (9), then 91 is a direct sum of the invariant subspaces 

L/niH hni_i+l3 * * ' )fn\-\ l-nj* 

Suppose now that £*, / = 1, 2, • • *, j, are the projections de- 
termined by our decomposition. We recall that, if 



(10) x = Xx + x 2 -\ h 

where x { e 9 U, then Ei is the mapping x — > X{. We have also 
seen (p. 60) that the following relations hold: 

(11) Ei 2 = E iy EiEj = 0, E x + E 2 +•••+£, = 1. 

The space 9?; = 9tE*. Hence each Ei is ^ 0. Now let A ei 2. 
Then 

xA = Xx A -f" x 2 A T“ • • • “b x s A 



and, since 9?; is invariant, X{A e 9 l». Thus the component in 9^- 
of xA is XiA y or, 

(12) xAEi = X{A = xE{A. 

This shows that the projections Ei commute with every A e 0. 
Conversely suppose the Ei are linear transformations 9 ^ 0 which 
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satisfy (11) and which commute with every A e 12 . Then we 
know that 92 = 92 1 © 92 2 © * • • © 9?* where 92; = 9 lEi. More- 
over, if A e 12 and Xi e 92;, then X{ = xEi and XiA = xEiA = 
(, xA)Ei e 9 2;. Hence 92; is invariant relative to 12 . 

If j > 1 in our discussion, each 9 2; is a proper subspace and 
each Ei ^ 1. Hence we see that, if 12 is a decomposable set of 
linear transformations, then there exist projections ^ 0 , ^ 1 
which commute with every A in 12 . Conversely, if E\ is a pro- 
jection 7 ^ 0 , 7 ^ 1 which commutes with every A in 12 , then E 2 
= 1 — Ei has these properties also. Moreover, Ei and E 2 are 
orthogonal. Hence 9 2 = 92£i © 9 lE 2 and the 92£; are proper in- 
variant subspaces. This proves the following important criterion. 

Theorem 1. A set 12 of linear transformations is decomposable 
if and only if there exist projections E ^ 0, ^ 1 which commute 
with every transformation in 12. 

EXERCISE 

1. Show that, if 92 is irreducible (indecomposable) relative to a subset of 12, 
then it is irreducible (indecomposable) relative to 12. Use this to prove that the 
set of linear transformations corresponding to the set of triangular matrices 

m an 0 - 

«21 Ot22 



L«nl Ot n 2 a nn-* 

is an indecomposable set. 

5. Complete reducibility. If ©i and © 2 are invariant sub- 
spaces under 12, then so is ©i fl © 2 and ©i + © 2 . Hence the 
totality of invariant subspaces is a sublattice L a of the complete 
lattice L of subspaces of 92. It is natural to apply lattice-theo- 
retic ideas in the study of the set L n and this is, in fact, what we 
have done in the foregoing discussion. Thus the statement that 
12 is irreducible amounts to saying that L Q contains just two ele- 
ments. Also the statement that 12 is decomposable may be for- 
mulated as a property of the lattice Z, Q . Of the properties of L 
we singled out in Chapter I, it is clear that the chain conditions 
and the Dedekind law are preserved in passing to the sublattice 
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L q . The complementation property of L y however, may or may 
not be valid in It certainly holds if 12 consists of the identity 
1 only, for then L Q = L. Also, if 12 is irreducible, then L a is com- 
plemented, and there are other less trivial examples which will be 
encountered later. 

If L q is complemented, we say that 12 is a completely reducible 
set of linear transformations. This, of course, means that, if 
© is any invariant subspace relative to 12, then there exists a 
second invariant subspace ©' such that 9{ = © © ©'. This con- 
dition is admittedly rather elusive since it applies to every in- 
variant subspace ©. It is therefore remarkable that this (pos- 
sibly infinite) set of conditions can be replaced by the single con- 
dition given in the following 

Theorem 2. A set 12 of linear transformations is completely re- 
ducible if and only if 9t can be expressed as a direct sum of sub- 
spaces 9L* that are invariant and irreducible relative to 12. 

Sufficiency . Let dt = 9L © 9J 2 © * * * © 9t s where the 9t; are ir- 
reducible invariant subspaces. If © is any invariant subspace, 
either © = 9? or there exists an 9 l iy say 9?i, such that 9fi ©. 
Then we set ©i = © + 9?i. Now © fl 9?i e L Q and since © fl 9ii 
is contained in the irreducible invariant subspace 9ti, either 
© fl 9ii = 9ii or © fl 9ti = 0. Since © fl 9ii = 9? i is equivalent 
to © 3 9 ii, we must have © fl 9?i = 0. Hence ©i = © + 9?i = 
©®9ii. We now repeat the argument with © x in place of ©. 
Then either © x = 9?, in which case 9ii is a complement of ©, or 
there exists an 9 h y say 9 i 2 , such that © 2 = ©i + 9i 2 = ©i © 9? 2 . 
Then © 2 = © © 9?i © 9J 2 . Eventually we obtain for a suitable 
choice of the notation that 9i = © © 9?i © 9? 2 © • • • © 9 1*- Then 
©' = 9?i © 9t 2 © • • • © dth is a complement of © in 9 L 

Necessity . Suppose that 9?i is an irreducible invariant subspace 
of 9t Then either 9i = 9ii is irreducible or 92 = 9?i © 9V where 
Wf is an invariant subspace ^ 0. Next let 9f 2 be an irreducible 
invariant subspace of 9fi'. If 9?i' = 9t 2 , we have 9? = 9?i © 9? 2 , 
9f i irreducible invariant as required. Otherwise, 9fi + 9 l 2 has a 
complement dl 2 ' y 9J 2 ' invariant. Then 9? = 9Ji © 9J 2 © 9? 2 '. We 
repeat the argument with 9 If- This leads finally to 9t = 9?i © 
dt 2 © * • * © 9i s where the 9J* are invariant and irreducible. 
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EXERCISES 

1 . Use the argument in the first part of Theorem 2 to show that, if 9t is a sum 
(not necessarily direct) of the irreducible invariant subspaces 9 J;, then 9 ? is 
completely reducible relative to 12 . 

2. Show that, if oo is any set of diagonal matrices, then the corresponding set 
£2 of linear transformations is completely reducible. 

3. Let G = { *S* i, 6 * 2 , • • •, 6 *™} be a finite group of linear transformations, that 
is, a subgroup of the group of 1-1 linear transformations of 9? over A. Assume 
that the order m of G is not divisible by the characteristic of A. Show that, if 
© is an invariant subspace relative to G and E is a projection onto ©, then 

1 / m \ 

Eq = — (2 SiESi~ x ) is a projection on © which commutes with the S{. Hence 

w \ t = i / 

prove the important theorem: 

Any finite group of linear transformations whose order is not divisible by the 
characteristic of the division ring is completely reducible. 

4. Let 12 be an arbitrary set of linear transformations and let E be any projec- 
tion such that EA = EAE holds for every A in 12. As in § 2 define the linear 
transformation Ae of 9t = 9V9i E into © = 9 IE by xA_e = x(AE — EA). 
Prove that, if there exists a linear transformation D of into © such that 
Ae = AD — DA holds for all A , then © has a complementary invariant sub- 
space. 

5. Prove that if 12 is completely reducible, and © is an invariant subspace, 
then the set of linear transformations induced in © is completely reducible. 

*6. Relation to the theory of operator groups and the theory 
of modules. The theory of sets of linear transformations that 
we are considering here can be regarded as a specialization of 
the theory of groups with operators (M-groups).* A reader who 
is familiar with the latter concept will observe that we are deal- 
ing here with the additive group 9 ? considered as a group with 
operator set M = A* U 12, A* the set of scalar multiplications. 
The concept of M-subgroup evidently coincides with that of in- 
variant subspace relative to the set of linear transformations 12. 
Hence the concepts of reducibility, decomposability, composition 
series coincide with these concepts for the M-group 9?. 

The theory of M-groups also suggests the introduction of the 
following concept of homomorphism between invariant subspaces 
or factor spaces relative to the set 12. A mapping 6 is said to be 
an 12 -linear transformation of one such space into a second one if 
6 is a linear transformation and (xA)6 = (xd)A holds for all the 
induced transformations A e 12. Similarly we say that two sub- 

* See Volume I, Chapters V and VI. 
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spaces 9 l*, * = 1, 2, are 0 -isomorphic or equivalent if there exists 
a 1-1 12-linear transformation 6 of 9ti onto dt 2 . If (e ly e 2y • • *, e r ) 
is a basis for 9?i, then (ei 8 y e 2 0 y • • *, e r 8) is a basis for dt 2y and it 
is immediately verifiable that every A has the same matrix rela- 
tive to (e ly • • •, e r ) and to (erf, • • *, * r 0). It follows that for arbi- 
trary choice of bases in 9?i and 9^2 the matrices («i) and (a 2 ) of 
any A etl are related by (a 2 ) = (m)(^i)(m) _ 1 where the non- 
singular matrix (/*) is independent of A. 

We can now state the following two fundamental theorems 
which are taken from the theory of M-groups. 

Jordan-Holder theorem. 7/0 c 9ii <= 9^2 ^ * * * c: SR* = 9? and 

0 a ©! cz ©2 ci • • ■ c © v = 9t are two composition series for a set 
ft of linear transformations y then t — u and the factors of composition 
dti/dti-i y @y/©y_ i can be put into 1-1 correspondence in such a 
way that corresponding pairs are £l-isomorphic. 

Krull-Schmidt theorem. If 9t = 9?i ® 9? 2 ® * * * © = ©i © 

© 2 ® * • * © ©fc are two decompositions of 9? into non-zero invariant 
and indecomposable subspaces relative to ft, then h — k and y if the 
@* are suitably ordered y 9J» and ©*■ are ^-isomorphic. 

We refer the reader to Volume I, Chapter V, for the proofs of 
these theorems. 

We can also absorb the present theory into the theory of mod- 
ules. This comes about from the fact that 9 1 is commutative. 
Hence the endomorphisms of 9t form a ring and the set M = 
Ai U ft generates a subring o (M) of the ring of endomorphisms of 
9 1. We call o = o(M) the enveloping ring of M. Now if © is a 
subgroup of the additive group 9 l y then the set of endomorphisms 
of 9 1 that map © into itself is a subring of the ring of endomor- 
phisms. Hence if © is an ft-subspace, this subring contains M 
and consequently it also contains o (M). Thus we see that any 
ft-subspace of 9J is an o(M)-subgroup. The converse is, of course, 
clear. In a similar manner we see that, if 0 is an ft-linear trans- 
formation, then 0 is an o(M)-homomorphism, and generally speak- 
ing nothing is changed in shifting from the set M to its envelop- 
ing ring. We have seen that in dealing with a ring of endomor- 
phisms it is often convenient to regard the underlying group as 
a module. 
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It is usually difficult to obtain the structure of the enveloping 
ring o(M) and, when this is the case, there is no particular gain 
in shifting from the point of view of sets of linear transformations 
to that of modules. If A = $ is a field, the problem of determin- 
ing the structure of o is easier than in the general case. Here we 
note also that <E>* is contained in the ring of linear transformations 
and o can be regarded as an algebra over <£. Consequently in 
this case one speaks of the enveloping algebra of the set £ 2 . 

7. Reducibility, decomposability, complete reducibility for a 
single linear transformation. We adopt again the original point 
of view and we consider in this section the special case in which 
£2 consists of a single linear transformation and A = <£ is a field. 

If / is a non-zero vector in SR, then the cyclic subspace {/} is 
an invariant subspace ^ 0. Let /x/(X) be the order of / and sup- 
pose that fif( X) = 7 r(X)^(X) where i r(X) is irreducible and has posi- 
tive degree and leading coefficient = 1. Then g = fv(A) has 
order i r(\). Thus SR contains a vector whose order is a prime. 
Suppose now that A is irreducible. In this case we have SR = 
where /x/X) = i r(\) a prime. Conversely suppose that SR has 
this form and let © be a subspace 7 ^ 0 invariant under A . If h 
is a vector in ©, the order ma(X) is a factor of 7 r(\). Hence if 
h 0, jJL h (k) = 7 r(X). Hence dim {h} = deg 7 r(X) = dim {^}. It 
follows that {h} = We have therefore proved the following 

Theorem 3. A linear transformation A in SR overQ is irreducible 
if and only if it is cyclic and has prime minimum polynomial . 

We consider next the question of decomposability of a single 
linear transformation. We know that SR = {/ 1 } © 5 / 2 } © * * • © 
{ft}- Hence a necessary condition for indecomposability is that 
t = 1, that is, 9? = {/} is cyclic. We have also seen that, if the 
minimum polynomial of a cyclic linear transformation can be 
expressed as mi(X)m 2 (X) where (mi(X), M 2 (X)) = 1, then SR = {^i} 
© {^ 2 } where M* t (X) = /*»(X). Hence a cyclic linear transforma- 
tion is decomposable unless its minimum polynomial has the 
form 7 r(X) fc , 7 r(X) prime. Conversely, these conditions are suffi- 
cient; for suppose that A is cyclic with 7r(\) k as its minimum 
polynomial. Assume that SR = SRi © SR 2 where SR* is invariant 
under A . Let ju/X) be the minimum polynomial of the transfer- 
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mation induced by A in 9J». Then since Xiw(A) k = 0 for all x { 
in %ti> | Tr(\) k . Hence /z*-(X) = ^(X)* 4 , ki < k. Now xcir(A) ki 
= 0 for all Xi in dl iy and this implies that, if k = max(k ly k 2 ) y 
then w(A)* = 0. Thus k = k and so we may suppose that k\ 
= k . This means that the minimum polynomial of A in 9ti is 
7r(X)*. Consequently there exists a vector f x in 9fi whose order 
is7r(X) fc . Then dim {/ x } = deg7r(X)* = dim 9t. Hence {/i} = 9 X 
and since {/i} £ 9?i, 9?i = 9?. Thus the decomposition 9? = 9ii 
© 9?2 is not proper. 

Theorem 4. A linear transformation A in 9? over <£ is inde- 
composable if and only if it is cyclic and its minimum polynomial 
is a power of a prime . 

We suppose next that A is completely reducible. Then 9? = 
91 1 © 9?2 ® * • • © 9t s where the 9?» are invariant and irreducible. 
By Theorem 3 the minimum polynomial of A in 9i» is a prime 
7T i (X) . It follows that the minimum polynomial ju(X) of A in dt 
is the least common multiple of the 7r»(X). Hence /z(X) is a product 
of distinct primes. Conversely, let A be a linear transformation 
whose minimum polynomial ju(X) is a product of distinct primes. 
Then each invariant factor has this form too. Hence the ele- 
mentary divisor ideals are of the form ( 7 r»(X)), 7 r*(X) a prime. 
Now, we know that this implies that 9J is a direct sum of cyclic 
subspaces 9 1» = where the order n gi (\) = 7r*(X). By Theo- 
rem 3 each 9 is irreducible. Hence 9? is completely reducible. 
This proves the following 

Theorem 5. A linear transformation A in 91 over 4> is completely 
reducible if and only if its minimum polynomial is a product of 
distinct primes. 

EXERCISES 

1. Let A be cyclic with minimum polynomial /x(X). Show that, if © is in- 
variant under A , then © is a cyclic subspace. Show that the invariant subspaces 
of 9? can be put into 1-1 correspondence with the factors having leading co- 
efficient 1 of /z(\). 

2. Let 4> be infinite and let 9? = {/ 1 } ® {^ 2 } be a decomposition of 9? relative 
to A such that M/i(X) = VffS) = 7 r(X) a prime. Show that 9? has an infinite 
number of subspaces invariant under A. 

3. Prove that, if is infinite, then the number of invariant subspaces relative 
to A is finite if and only if A is cyclic. 
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8. The primary components of a space relative to a linear 
transformation. The decomposition of a vector space into inde- 
composable subspaces relative to a linear transformation is not 
uniquely determined. On the other hand, as we shall show in 
this section, there is a decomposition which is not as refined as 
the one into indecomposable components but which has the im- 
portant advantage of uniqueness. 

Let ju(X) be the minimum polynomial of A and let 

(13) m(X) = 7T 1 (X) fc V 2 (X) fc2 • • • 7T S (X)*' 

be the factorization of ju(X) into powers of distinct irreducible 
polynomials having leading coefficients equal 1. Let dti be the 
subspace of vectors Xi e 9? such that 

(14) Xi7Ti(A) kt = 0. 

Then we assert that the spaces 9?* are invariant relative to A and 
that 

(15) 9? = 9ii © 9i 2 © • • • © 9t s . 

The invariance of the 9 i» is an immediate consequence of the fact 
that 7 Vi(A) ki commutes with A (cf. Ex. 1, p. 117). The proof of 

(15) can be obtained by using the results of the preceding sec- 
tion; however, we shall give an independent discussion that has 
some points of interest of its own. 

We note first that the polynomials 

(16) Mi(X) = n(\) / TTi(\) ki = 

^(X)* 1 • • • x,_ 1 (X)^V, +1 (X) ft<+1 • • • t.(X)% i = 1, 2, • • j 

are relatively prime. Hence there exist polynomials $i(X), 
<p 2 (X), • ■ - , <£a (X) such that 

(17) $i(X)mi(X) + $2(X)m2(X) + • • • + 0«(X)ms(X) = 1. 

We can substitute A in this relation and obtain 

(18) <f>i{A) ni(A) + <t> 2 (A)n 2 (A) + • • • + <i> a {A)^{A) = 1. 

Since ju»(X)juy(X) is divisible by jtt(X) if i 9^ j, 

<t>i(A)m{A)<}>j{A)nj(A) = 0. 




SETS OF LINEAR TRANSFORMATIONS 



131 



We set Ei — <f>i(A)m(A). Then this relation reads 

(19) EiEj — 0, i 9* j 
and (18) becomes 

(20) +E 2 +•••+£, = 1. 

If we multiply this relation by Ei and use (19), we obtain 

(21) Ei 2 = Ei. 

Thus the Ei are orthogonal projections with sum 1 and therefore 
91 = 9l£i © 9 IE 2 © • • • © 9 IE S . 

Since the Ei are polynomials in A, they commute with A ; hence 
the spaces 9J-E, are invariant relative to A. We shall now show 
that 9tE,- coincides with the space 9 ii of vectors at such that 
XiWi(A) ki = 0. First let y,- e 9tEi. Theny,- = uEi = u<t>i(A)iJ.i(A). 
Since p,(A)ir i(X) ki = ju(X), this gives 

y i 'K i {A) kx = u<j>i{A)iii(A)Tri(A) k ' = u<!>i(A)n(A) = 0. 

Hence e 9?*. Conversely, let 

X = X X + H h tf s , 

where xj e 9tEj, be a vector in 9t*. Then 

0 = Xi = XiWi(A) kt + X27Ti(A) kt H b X s TTi(A) kl 

and since XjT^A) 1 * e 9J£y each 

= 0 . 

On the other hand, XjTj(A) ki = 0, and since 7r*(X), 7ry(X) are dis- 
tinct if / j y this implies that ary = 0 for j ^ *. Hence x = Xi e 
9tE». This completes the proof that 91* = 9 t£*. 

It is clear that the minimum polynomial of A acting in 9 X% is 
of the form i r t -(X)^, /* < Moreover, if x = x x + x 2 + * * * + x s 
where Xi e 9 i*, and m*(X) = ?ri(X) z V 2 (X) 12 • • • x s (X) z *, then xm*(A) 
= 0 for all /. Hence xn*(A) = 0 and v*(A) = 0. Consequently 
m(X) I M*00- Evidently this implies that /i(X) = ju*(X) and that 
U — ki for / = 1, 2, • • *, s. This completes the proof of the 
following 
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Theorem 6. Let p(A) be the minimum polynomial of the linear 
transformation A in 9 ? over <£ and let (13) be the factorization of 
ju(X) into prime powers. Then , if 9i; is defined to be the subspace of 
vectors Xi such that XiWi(A) ki = 0, 

dt = 3ti ® © • • • ® 9t. 

and the minimum polynomial of the induced transformation in 9t* 
is 7 rf\) k \ 

We shall call the spaces 9 1; the primary components of 9? rela- 
tive to A. The projections Ei determined by the decomposition 
(15) will be called the principal idempotent elements of A. 

We specialize now by assuming that $ is algebraically closed. 
In this case the 7 r*(X) are of first degree, say, 7 r*(\) = X — p*. 
We set 

Ni = (A - Pi l)Ei 

so that 

(22) A = AEi + AE 2 + • • • + AE S = ( piEi + Ni) 

+ (p 2 E 2 + N 2 ) + • • • + ( p s E s + N s ). 

Evidently 

(23) NM = 0, EiNj = 0 = N 3 E { 
if / 7 ^ j and 

(24) EiNi = Ni = N { Ei . 

Also Ni ki = (A — pi\) ki Ei = EfA — pil) ki and if x is any vector 
then xEi e 9 1*-. Furthermore xfA — pi\) ki = 0. This proves that 

(25) Ni ki = 0. 

The nilpotent linear transformations Ni will be called the prin- 
cipal nilpotent elements of A. Like the Ei these linear transfor- 
mations are polynomials in A. 



EXERCISE 



1. Let 4> be algebraically closed of characteristic 0 and suppose that E Ni y i 
= 1, 2, • • •, s y are the principal idempotent and principal nilpotent elements of 
A and that A — 2( p t £; + Ni). Show that, if <j>(X) is a polynomial, then 



4>(J) = S [foi)Et + Ni + N * 



+ ... + < t >(ki V(pi) N*‘- 

+ + (ki- 1)! * 



1! 



2 ! 
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9. Sets of commutative linear transformations. We suppose 
now that £2 is a set of commutative linear transformations in 9? 
over <$. Assume first that £2 is indecomposable. Then we shall 
show that the minimum polynomial of any A e £2 is a prime power. 
For otherwise we can obtain a proper decomposition of 9J as 
9?i © 9?2 © * * * © where the 9?* are the primary components 
relative to A . Now it is clear that if B is a linear transformation 
that commutes with A then 9 UB c 9 1*. Hence the 91* are in- 
variant relative to £2, and this contradicts the indecomposability 
of £2. 

Assume next that 9? is irreducible. Then we assert that the 
minimum polynomial ju(X) of every A is irreducible. For if 7r(X) 
is an irreducible factor of /z(X), then the space © of vectors y such 
that yir(A) = 0 is ^ 0. Evidently © is invariant relative to £2. 
Hence © = 9i and i r(A) = m(A). 

Now let £2 be an arbitrary commutative set of linear transfor- 
mations. We first decompose 9 X as 



91 = 9? (1) ® 9t (2) ® ••• © 9t (h) 



where the 9J (i) are indecomposable. It is clear that the transfor- 
mations induced by the A ei 2 in any invariant subspace and in 
any factor space commute. This holds in particular for the 9? (0 . 
Our result in the indecomposable case, therefore, shows that the 
minimum polynomial of any A acting in 9l (l) is a prime power 
TTi(\) k \ We now choose a composition series 

(26) 0 c dh (i) c 9t 2 (i) c • • • c = 9* (i) 

for each 9t (l) . Then each factor space 9 f i/ l) /9 f l/_i (l) is irreducible 
relative to the induced transformations. Hence the minimum 
polynomial of the transformation A induced by A in 9 p J/ l) /9l;_-i (t) 
is irreducible. On the other hand, i Ti(A) ki = 0. Hence the mini- 
mum polynomial of A is 7 r* (X). 

We now choose a basis for 9? (t) corresponding to (26), that is, 
the first group of vectors is a basis for 9?i (i) , the second group 
supplement these to give a basis for 9?2 (l) > etc. The bases thus 
determined for the different 9t (i) constitute a basis for 9?. With 
respect to this basis the matrix of A has the block form 
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f(«l) 



(« 2 ) 



(27) 



where the block (a,-) that goes with 3{ (i) has the form 

:(Pi) o 

(182) 

(28) 



(A,)J 



The matrix (/3y) is a matrix of the induced linear transformation 
A. Hence its minimum polynomial is tt; (X). 

Suppose now that 3> is algebraically closed. Then 7r* (X) = 
X — Hence A = p*l is a scalar multiplication in i (i) - 

Since any subspace is invariant relative to a scalar multiplication, 
it follows that 9l/ l) /9?y_ i (l) is one dimensional. Hence the ma- 
trices (j8y) are one-rowed. Our final result can therefore be stated 
as the following theorem on matrices: 



Theorem 7. Let 3> be an algebraically closed field and let co be 
a set of commutative matrices belonging to <t> n . Then there exists a 
non-singular matrix (f) in <£> n such that 



(m)(«)(m) 1 



'(«l) 

(« 2 ) 



where 



(«n)J 



(«i) 



Pi 



Pi 



01 



p*-J 



/or all (a). 
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EXERCISES 



1. Let $ be the field of complex numbers and let 



•1 


0 


0- 




■ 0 


1 


1' 


0 


0 


-1 


08) = 


-1 


1 


-1 


.0 


1 


2- 




. 1 


1 


3. 



Verify that (a)(/3) = (&)(<*) and illustrate Theorem 7 with this pair of matrices. 

2. (Ingraham) Let A be a matrix in the block form 



’An 


A\2 


’ ’ A\m 


A 21 


A 22 


' * A2m 


-Afni 


A m 2 


' ' A mm- 



where the A a are r X r matrices with elements in an algebraically closed field. 
Assume that the An commute with each other and define 



detfl A = 2 ± Au l A2i 2 * • * A m i m 

where the sum is taken over all permutations ii , h> • • •, i m of 1, 2, • • •, w and 
the sign is + or — according as the permutation is even or odd. Use Theorem 7 
to prove the following transitivity property of determinants 

det(det# A) = det A . 

(This holds for arbitrary base fields. Compare Sec. 9, Chapter VII.) 

3. (Schur) Show that the maximum number of linearly independent commu- 

r» 2 i 

tative matrices which can be chosen in <£ n is — + 1, where, in general, [a] 

denotes the greatest integer in the real number a. 
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This chapter is devoted to the study of certain types of func- 
tions, called bilinear, which are defined for pairs of vectors (x> y ') 
where x is in a left vector space 9i andjy' is in a right vector space 
9i'. The values of g(x,y') are assumed to belong to A, and the 
functions of one variable g x (y') = g(x,y') and g y >{x) = g(x>y') 
obtained by fixing the other variable are linear. Of particular 
interest are the non-degenerate bilinear forms. These determine 
1-1 linear transformations of 9t' onto the space of linear functions 
on 9t. Consequently, if A is a linear transformation in 9?, there 
is a natural way of associating with it a transposed linear trans- 
formation in 9?'. 

If the division ring A possesses an anti-automorphism, then 
any left vector space 9? over A can also be regarded as a right 
vector space over A. Hence in this case one has the possibility 
of defining bilinear forms connecting the space with itself. Such 
forms are called scalar products. Their study is equivalent to 
the study of a certain type of equivalence for matrices called co- 
gredience. The most important types of scalar products are the 
hermitian, symmetric and alternate scalar products. We shall 
obtain canonical matrices for such forms, and, in certain special 
cases that are of interest in elementary geometry, complete solu- 
tions of the cogredience problem will be given. We shall also 
prove Witt’s theorem for hermitian forms and apply it to de- 
fine the concept of signature for such forms over an arbitrary 
division ring of characteristic 2. 
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1. Bilinear forms. If 91 is a (left) vector space over A, we have 
defined a linear function as a mapping x — > fix) e A such that 

(1) f(x +y) = fix) +/0), /(ax) = af(x). 

These mappings form a right vector space 91* relative to the com- 
positions 

(2) (/ + g)( x ) =/(*) +£(*), (/«)(*) = /(*)«• 

As we have seen, these definitions imply that, for each fixed vec- 
tor x e 91, the mapping / — > x(f) = /( x) is a linear function in 
91*. Because of this symmetry it is natural to regard f(x) = 
x(f) as a function of the pair x e 91,/ e 91*. We therefore denote 
the value /(x) as six, /), and the above equations now read 

(3) s(x +y,f) = j(*,/) + s(y,f), s(ax,f ) = as(x,f) 

(4) *(#,/ + £) = •!(*>/) + s(x,g), s(x,/a) = j(*,/)a. 

We shall now generalize this situation by assuming that 91' is 
any right vector space over A. Then a function g(x, y') defined 
for all pairs (*, y'), x in 91, y' in 91', with values g(x, y') in A is 
called a bilinear form, if 

(5) £(*i + * 2 , /) = g(x i, y') + g(x 2 , y'), 

g(ax,y') = ag(x,y'), 

(6) g(x, y/ + y 2 O = g(x, y i') + g(x, y 2 '), 

g(x,y'a) = g(x, y')a. 

Clearly s(x,f) is a bilinear form for the space 91 and its conjugate 
space 91*. 

On the other hand, as we proceed to show, the conjugate space 
91* can be used to give an alternative definition of a bilinear form. 
First, let g(x, y') be a bilinear form connecting the left vector 
space 91 and the right vector space 91'. We fix the vector y' and 
regard g(x , y') as a function of x. Accordingly we write g(x, y') 
= gv’ (*)• Then, by (5), g y >(x) is linear, that is, it belongs to the 
conjugate space 91*. Now let y' vary and consider the mapping 
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y f — > g y r e 9?*. We denote this mapping as R and we observe 
that, by (6), R is linear, since 

gvi'+vA x ) = g(*>y i + y*) = g(*>y i') + y*) 

= + <sw(*)> 

sv«0) = g(*>y'<*) = g(*> y')<* = <gyOV 

In a similar manner, if we fix *, then the function g x (y') = g(x y y') 
is linear in y'. Hence g x is in the conjugate space (SR')* of 9 t'. 
The mapping L: x — > g x is a linear transformation of 9? into (9t')** 
Conversely, suppose that we are given a linear transformation 
R: y r — » ^(V) of the right vector space 9J' into the space 9i* of 
linear functions on 9?. Then we can regard g y >{x) as g(x,y '), a 
function of (*, jy')> x in ^ JV 7 in 9?', and we can verify that this 
function is a bilinear form. Thus we see that an equivalent defi- 
nition for the concept of a bilinear form is that of a linear trans- 
formation of the right vector space 9i' into the conjugate 9J* of 
9t. Similarly, we could also say that a bilinear form is a linear 
transformation of 9? into the conjugate space (91')* of 9 l'. The 
original definition, however, has the advantage of symmetry over 
the present formulations, and it will be given preference in the 
sequel. 

EXERCISE 

1. Show that, if g(x,y') is a bilinear form and A is a linear transformation in 
9t, then g(xA , y') is a bilinear form. 

2. Matrices of a bilinear form. Suppose now that 3? and 
are finite dimensional and that (ei, e 2 , •••, e n ), (//, /■/, • • • , 

/„»'), respectively, are bases for these spaces. We shall call the 
matrix 

g(e u fi) g^uff) ••• g(e u f n > r ) 

g(e 2 ,fi) g{e 2 ,f 2 ) ■■■ g(e 2 yf n >') 

g(en,fl) g{e n ,f 2 ) ■ ■ ■ g{e n yjn’ 0 

the matrix of the bilinear form g relative to the given bases. The 
form is completely determined by this matrix; for if x and y' 
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n n' 

are arbitrary in 9 1 and 9?', we can write x = ^ &*», jy' = 'Hf/ r }jy 

i i 

and we obtain 

£(*>/) = 2 //^-) = 

Hence g(#, jy') is known from the representations of x and jy' and 
from the entries g(e iy f/) of (7). Also it is clear that, if (&y) is 
any n X n r matrix with elements in A, then there exists a bi- 
linear form h(x y y') which has this matrix as its matrix relative 
to Oi, e 2y • • •, e n ) y * * - ,/n' 0; for we can define 

/) = 2 ZiPiMj 

and it is easy to verify that this function is bilinear. Since 
g{e% y f/) = Pij the matrix of g is the given one. 

We consider now the effect on the matrix (/ 3 ) of g(x y y') of 
changes of bases in the two spaces. Let (u ly u 2y • • •, « n ) be a 
second basis in 9t, Ui = 2/x^y where (/*) eL(A, ri) y and let {vi y 
v 2 y • • •, zvO be a second basis in 9 y v k = 2//^*, (V) e L(A, w'). 
Then 

g(Ui y V k ) = gi^fJ'ij^j) 2/* £ J'Zfc) = 

so that the new matrix is 000000 a matrix equivalent to (/3). 
We remark that in the commutative case this relation is usually 
encountered in a slightly different form. Here one generally con- 
siders all vector spaces as left vector spaces. Then the change of 
basis in 9i' reads v k — ^v k ifi , and the new matrix of the bilinear 
form is where ( v) f is the transpose of the matrix 

O') = 0 '«)• 

We return now to the general case of an arbitrary A. We have 
proved in Chapter II (p. 45) in connection with the theory of 
linear transformations that, if (0) is any rectangular matrix with 
elements in A, then there exist non-singular square matrices (fx) 
and O') such that 

(8) 0008)00 = diag {1, •••, 1,0, 

This matrix result yields the following fundamental theorem on 
bilinear forms. 
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Theorem 1. If g{x y y f ) is a bilinear form connecting the vector 
spaces 9? and 9i', then there exist bases (u u u 2y • • *, u n ), (vf y v 2 \ 
• • - , v n ' ') for these spaces such that 

^ s(»b v/) = bij if ij = 1 , 2, • • •, r 

g(u iy v/) =0 if i > r or j > r. 

Evidently the number r is the rank (row or column) of the 
matrix (0) of g(x> y'). An abstract characterization of this num- 
ber will be given in the next section. 

3. Non-degenerate forms. If g(x y y f ) is a bilinear form, the 
totality of vectors z e 9? such that g(z> y') = 0 for all y ! e 9t' is 
a subspace 9J: which we shall call the left radical of g. Similarly 
we define the right radical 91' as the subspace of 9 1' of vectors z f 
such that g(x> z') = 0 for all x. It is clear from the definition 
that the left (right) radical is just the null space of the linear 
transformation L (R) of 9? (9? r ) into the space of linear functions 
on 9 V (9?). If {fluffy * * * ,/n'O is a basis for 9 V and z e 9f, then 
g(z,f/) = 0 for j = 1 , 2, • • •, n' . Moreover the n' equations 
g( z >ff) — 0 imply that g(z> x f ) = 0 for all x'. Hence these con- 
ditions are also sufficient that z e 9J. Now let (ei, e 2y • • •, e n ) be 

n 

a basis for 9? and write z = ^ f *<?*•. Then 

i 

(10) 0 = g{zjf) = g&UeiJf) = Z fAy, j= 1, 2, • ■ •, n\ 

i=i 

are the conditions that z = e 9i. It follows that dim 9t is 
the maximum number of linearly independent solutions of (10). 
Hence dim 9f = n — r where r is the rank of (0). Similarly we 
see that dim 9!' = n f — r. We remark also that, if we use the 
normalized bases given in Theorem 1, then we see that 91 = 
[u r + 1 , U r+2y • • •, u n ] and 9f' = [v r +i, v r+2 ' y • • •, v n /]. For it is 
clear that the Uj with j > r are in 9i. Moreover, if e 9f, 

then, in particular, for k < r, 

0 {Uiy Ufa*) T&* 

n 

Hence the vector has the form ^2 yjUj , and therefore it belongs 

r-j-l 

to [u r+ 1 , u r+ 2 , • • •, a n ]. This proves the assertion about 91. A 
similar argument applies to 91'. 
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We know that dim dtL + dim 9£ = n and dim 9J'i? + dim 5ft' 
= n f . It follows that dim 9?Z, = r = dim SR'i?. 

The bilinear form g is said to be non-degenerate if = 0 and 
9T = 0. Thus g is non-degenerate if the only vector z in 9i such 
that g(z y y') = 0 for all y f is 2 = 0 and the only vector z' in 9i' 
such that g(x y z f ) = 0 for all x is z r — 0. If g is non-degenerate, 
then the mappings L and R are 1-1. Also since dim 91 = 0 = 
dim 9T, then n = r = n* . Thus we see that, if and 9 V are con- 
nected by a non-degenerate bilinear form, then these spaces have 
the same dimensionality n. We recall also that a space and its 
space of linear functions have the same dimensionality. Hence 
the mappings L and R are 1-1 linear transformations of vector 
spaces into vector spaces of the same dimension. It follows that 
these are mappings onto the corresponding spaces. The relations 
n = r — n' are also sufficient that the bilinear form be non-de- 
generate; for they clearly imply that dim 91 = n — r — 0 and 
dim 9T = n r — r = 0. Our results can be summarized in the fol- 
lowing 

Theorem 2. Necessary and sufficient conditions that a bilinear 
form g(x,y') connecting 9? and 9 V be non-degenerate are\ 1) 9? and 
9 V have the same dimensionality ; 2) the matrix of the form relative 
to any pair of bases is non-singular. If g is non-degenerate and 
fi(y') IS a linear function on 9i', then there exists one and only one 
vector x in 9 1 such that g(x,y f ) = f\(y f ) holds for all y f . Also if 
/ 2 (V) is any linear function on 9J, then there is a unique vector y' 
in 9t' such that g(x> y r ) = / 2 (#) holds for all x. 

If two vector spaces 9tand 9 V are connected by a non-degenerate 
bilinear form g y then we shall say that these spaces are dual rela- 
tive to g. Suppose that this is the case and let (e ly e 2y • • *, e n ) 
be a given basis in 9 1. We choose a basis (fiyff* * * ‘>fn') for 
91', and we obtain the non-singular matrix (0) where fin = g{en 
//). Now let (y) be any non-singular matrix in A n and let vf = 
S/iVi/fc be the corresponding new basis in 9 {'. Then we have seen 
that the matrix of g(x,y') relative to the pair of bases (e ly e 2 > 

• • *, e n ) y ( vfy v 2y • • •, v f) is (fi)(v). Thus there exists a uniquely 
determined basis (ef, e 2 , • • •, e n f ) for 9?' such that the matrix of 
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g relative to (e ly e 2 , •••, e n )> ( ei\ e 2 , ••*, e n f ) is the identity 
matrix, that is, 

(11) g(e iy e/) = 8 ijy i y j = 1, 2, •••,». 

The e/ are obtained by taking ( v ) = (/3) -1 . We call the bases 
(eiy e 2y • • *, ^ n ), (^i', e 2 y • • *, ^ n ') complementary bases for the dual 
spaces. Our argument shows that any basis in 9t (90 determines 
a unique complementary basis in 9T (9t). 

Complementary bases have been used in our study of the no- 
tion of incidence between subspaces of 9 1 and of its conjugate 
space 9J*. These spaces are dual relative to the fundamental bi- 
linear form s(Xy /) = /(*); for if f(x) = 0 for all x y then / = 0 
by definition, and if x 9 ^ 0, then there exists a linear function / 
such that f(x) 0. We can carry over to the general case the 
results on incident spaces. Thus let 9? and 9t' be dual relative to 
g( x >y')- Then if © is a subspace of 9f, we define the space j(©) 
incident to © to be the subspace of 9t' of vectors y' such that 
g{x y y') = 0 for all x e ©. In a similar manner we define j{&) 
for a subspace ©' of 90 We can prove as in the special case of 
9 1 and 9f* (p. 55) that dim y(©) = n — dim © and dim^/X©') = 
n — dim ©'. These relations, together with the obvious rela- 
tions 7 OX©)) 2 ©>iO(©0) 3 imply that 

iO((©)) = ©, jOW)) = 

Also, as in the special case of the space of linear functions we can 
prove that the mapping © — » j(<&) is an anti-automorphism of 
the lattice of subspaces of 9i onto the lattice of subspaces of 90 
The inverse of this anti-automorphism is, of course, the mapping 
©' — > j(&). We leave it as an exercise to the reader to fill in 
the details of this discussion. 

EXERCISE 

1. Let 9? and 9{' be connected by the bilinear form g(x,y') and let 91 and 91' be 
the radicals determined by this form. If x + 91 8 9J/9J and y f + 9t' e 9?'/9i', set 
g(x + 9t, jy' H- 91') = g(Xjy f ). Show that this defines a non-degenerate bilinear 
form for the factor spaces 9?/9f, 9t'/90 

4. Transpose of a linear transformation relative to a pair of 
bilinear forms. Let 9?,-, i = 1, 2, be a left vector space and let 
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91/ be dual to 9t» relative to the bilinear form gi(xi,y/). If A is 
a linear transformation of 9?i into then we know how to de- 
fine the transpose A * as a linear transformation of the conjugate 
space 9? 2 * into the conjugate space 9fi*. If/e 9t 2 *> then /A* is, 
by definition, the resultant Af. We can now make use of the 
equivalences Ri of 9?/ onto to define a linear transformation 
of 9? 2 ' into SR/, namely, we set 

(12) A' = R 2 A*RrK 

Clearly this product can be defined since we have the following 
scheme of linear mappings: 

R 2 : 8t 2 ' -> $ 2 * 

A *: $R 2 * -> 3ti* 

Rr 1 : -> SRi'. 

Also it is clear that A' is a linear transformation of 9t 2 ' into SR/. 
We shall call this transformation the transpose of A {relative to the 
bilinear forms gi and g 2 ). 

We determine next the form of A'. Let y 2 e 9t 2 '. Then y 2 R 2 
is the linear function g 2 {x 2 ,y 2 ) on the space $ft 2 . Moreover, 
y 2 'R 2 A* is the linear function f x on 9ti such that 

(13) /i(*i) = g2(*iA y y 2 '). 

Finally y 2 'i? 2 ^*i?i“ 1 is the vector yf of 9?/ such that 

(14) /i(*i) = g2(* 1 A,y 2 ') = giixuyi) 

holds for all Xi in 9ti. The mapping A f sends y 2 into yf and, 
according to the above equation, yf = yf A r is the uniquely de- 
termined vector of 9?/ such that 

(15) giix^yfA') =g 2 (xiA y y 2 ') 
holds for all Xi in 9ii. 

Since the notion of duality is a symmetric one, we can inter- 
change the roles of 9 U and 9 in the foregoing discussion. Thus 
if A* is a linear transformation of dt 2 into 9?/, then we define its 
transpose (relative to the given forms) to be the mapping A n = 
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LiA f *L 2 ~ l where Li is the fundamental equivalence of 9ii onto 
9V*> A r * is the transpose of A f , and L 2 is the equivalence of 
yt 2 onto 9V*- In analogy with (15) we can verify that Xi A n 
is the uniquely determined vector such that 

(! 6 ) g2(xiA",y 2 ') = giixuy^A') 

holds for all y 2 in $t 2 '- Now it is clear from (15) and (16) that, 
if A' is the transpose of A> then A is the transpose of A'. Also if 
A" is the transpose of A\ then A' is the transpose of A" . Thus the 
two correspondences A — > A * , A ' — > A n are inverses of each 
other. Hence A — » A r is a 1-1 mapping of the set ?(9ii, St 2 ) of 
linear transformations 9?i into 9J 2 onto the set 9J/) of 

linear transformations of 9 \ 2 into 9 l/- Of course, this can also 
be seen directly from the definition of the transpose and the 
properties of the mapping A — > A ' . 

The algebraic properties of the mapping A — > A' can also be 
deduced from the previously established properties of the map- 
ping A — > A *. We state these without proofs: 1) If B is a sec- 
ond linear transformation of 9?i into 9 1 2 , then 

(17) (A + BY = A' + B'. 

2) If 9?3 and 9V are dual relative to the non-degenerate form 
^ 3 (^ 35^30 and C is a linear transformation of 9 1 2 into 9i 3 , then 
we denote the transpose of C relative to g 2y g 3 by C\ Also we de- 
note the transpose of AC relative to gi> g 3 by (. AC)' . Then we 
have the relation 

(18) (ACY = CA f . 

3) As usual if we deal with a single vector space 9? and its dual 
relative to gix^y’), then 1) and 2) show that the mapping A — * 
A\ the transpose of A relative to g , is an anti-isomorphism of the 
ring of linear transformations in 9f onto the ring of linear trans- 
formations in 9t'. 

Finally, we wish to determine the relation between the ma- 
trices of a linear transformation and its transpose. Again let A be 
a linear transformation of 9ii into 9?2 and let 9ti' and 9t 2 ' be the 
duals relative to the forms £1 and g 2 respectively. Let (e ly e 2 , 
• • •, O be a basis for 9?i; (tf/, e 2 , • * •, £ n /), the complementary 
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basis in Similarly let C/ 1 ,/ 2 , • ■ - ,/„ 2 ) be a basis for 9? 2 and 

(fiyffy • • • , /„/) the complementary basis. Suppose that 

n 2 

= &ikfk) i — Xy'Zy * y 71 \ 
k = l 

n\ 

fi'A' = EejW, /= 1 , 2 , 

i— i 

Then the conditions 



yield the relations 



g 2 {e i Ayf l f ) = gMyfM') 



Oiil = a<z' 



for the matrices. Thus as in the special case of spaces and the 
dual spaces of linear functions, we have the rule: If complementary 
bases are used in the dual spaces , then the matrix of a transformation 
A and of its transpose A* are equal. 



EXERCISE 

1. Supply proofs of statements 1), 2) and 3) above. 

5. Another relation between linear transformations and bi- 
linear forms. We assume again that the spaces 9 1; and 9 ?/, / = 
1, 2, are dual relative to g im Let u r and v be fixed vectors in 9V 
and 9t 2 respectively. Then if x ranges over 9ti, the mapping 

(19) X -> gfXy u')v 

is a linear transformation of 9? i into 9?2- This is clear from the 
properties of the bilinear form. Since our transformation is de- 
termined by the pair u\ v , we shall denote it as the “product” 
u r X v. More generally if uf y u 2 , • • •, u^f are in 9?/ and v 2y 

• • •, v m are in 9I 2 , then the mapping uf X V\ + u 2 X v 2 H 

+ Uyf X v m is the linear transformation 

X -» gi(x y Ui)Vi + gi(x y u 2 )v 2 H h gi(x y u m ')v m . 

We show next that any linear transformation of 9?i into 9?2 
has this form. For let A e?(9li, dt 2 ) and let (vi> v 2y • • •, v r ) be a 
basis for the rank space dt\A. Then if x is any vector in 9ii, xA 
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can be written in one and only one way in the form 

(20) xA = (j>iVi + <j> 2 V 2 + # • • + <l>rVr 

where the coefficients 0* are in A. Since these coefficients are 
uniquely determined by x> we may consider them as functions of 
x y and accordingly we write 0* = <f>i(x). Thus (20) can be re- 
written as 

(20') xA = <t>i(x)vi + 4> 2 (x)v 2 H b 4>r(x)v r - 

If y is a second vector in 9?i, then 

20,-(* + y)vi = (x + y)A = xA + yA = 2 4>i(x)vi + 2 0*(yK-. 

Hence <t>i(x + y) = <t>i(x) + 0*(jy). Also by (20'), if a e A, then 

{ax') A = a{xA) = a<f>i{x)vi + a<f> 2 (x)v 2 +• • • + a4> r (x)v r y 

and this implies that <!>i{ax) = a<f>i{x). Hence the functions <f>i(x) 
are linear. Since g\{x y y f ) is a non-degenerate form, it follows 
that there exist vectors u{ in dti such that </>i(x) = gi(Xy u/) 
holds for all x. Thus 

xA = gx(Xy U 1 ')v 1 + gx(Xy U 2 )v 2 4 b gl(x y U r ')v r 

and A = Ui X v x + u 2 X v 2 + • • • + u/ X v r as required. 

The pairs of vectors («i', Vi) y {u 2y v 2 ), • • - , (u m ' y v m ) also define 
a linear transformation A r of 9t 2 ' into 9?i'. If x' & 9t 2 ', then we 
define 

x'A' = Uig 2 (v 1 , x') + u 2 g 2 {v 2 y x f ) H b u m 'g 2 {v m y X '). 

We shall denote this transformation as 

X' Vi + U 2 X' v 2 b Urr! X' 0«. 

As is indicated by our notation the transformations A and A 1 
are transposes relative to the bilinear forms £1 and g 2 ; for if x 
is any vector in and x f is any vector in 9 ? 2 ', then 

gi(x> x'A') = gi(x, X) Uig 2 {vi, x')) = Ui)g 2 (vi, x') 

and 

g 2 (xA, X') - g 2 (y^gi{x, Ui) Vi, X'^ = £)gi(x, Ui')g 2 (Vi, x'). 
Hence g\{x y x'A') = g 2 {xA , x'), and this proves our assertion. 
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We consider again the representation of the linear transforma- 
tion A as «/ X It is clear from the definition of the product 

i 

u! X v that 

(21) (ui + u 2 ) X v = Ui X v + u 2 X v, 

u f X (^i + v 2 ) = u' X Vi + u! X v 2 

(22) u'cl X v = u r X av. 

Hence if we express the u/ as u/ = Xe/Pji y then 2«/ X Vi = 
EefPji X Vi = 2<?/ X PjiVi = 2e/ X w$ where wj = 2j8y<0<. Thus 
if we can write the u/ , i = 1, 2, • - • y m y in terms of fewer than m 
vectors e/ y then we can obtain expressions for A = 2 X as 
a sum of fewer than m products e / X Wj. Similarly if we can 
write the Vi as V{ = 2a*y/y where the number of/’s is less than w, 
then we can obtain an expression for A as a sum of less than m 
products. 

r 

Now let xf = ^ u i X be an expression for A for which r is 

i 

minimal. Then our remarks show that the sets (u \ , u 2y • • •, u r r ) 
and (v u v 2y • • •, v r ) are linearly independent. Now there exist 
vectors ( u ly u 2y • •, u r ) in 9ti such that 

Th */) = = 1, 2, • • -, r. 

UiA = «/)»y = Vi 

3 

and [»x, o 2 , • • •, o r ] £ On the other hand, for any x 

XA = £ gx{x, u/)Vj 8 [Pi, P 2 , • • Or]- 

3 

Hence 9ti A = [vi y v 2y • •, p r ], and r is the rank of A . If we use 
the form of the transpose determined above, we see that the rank 
space dt 2 A f = [ui y u 2 y • • •, «/]. We remark that this gives still 
another proof of the theorem that rank A = rank A r . 

6. Scalar products. We know that, if dt is a left vector space 
over a field $, then 9J can also be regarded as a right vector space 
over $. Hence we have the possibility of defining bilinear forms 
connecting 9? with itself and of regarding 9t as a dual of itself. 
These considerations can also be applied to the division ring case, 
provided that A possesses an anti-automorphism. 
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Thus suppose that A is a division ring in which there is defined 
an anti-automorphism a — > a. The defining properties of such 
a mapping are that it is 1-1 of A onto itself and that it satisfies 
the conditions 

(23) a + = a + /3, a(3 = (3a. 

Thus, for example, A can be taken to be the division ring of 
Hamilton's quaternions and a — > a the mapping of a quaternion 
onto its conjugate. We remark also that, if A = $ is a field, 
then an anti-automorphism is just an automorphism in <3>. Hence 
in this case a — » a can be taken to be any automorphism of $>. 
In particular, we can take a = a, that is, the mapping is the 
identity automorphism. This is the classical case, and it will re- 
ceive special emphasis in our discussion. 

Let 9? be a (left) vector space over A. By using the given anti- 
automorphism a — > a it is easy to turn 9? into a right vector 
space over A. We have merely to set xa = ax> or, in other words, 
if a — > a A is the inverse of a — > a, then xa = a A x . One verifies 
that this definition of right multiplication by scalars satisfies the 
basic requirement (cf. p. 6). Hence 9i becomes, in this way, a 
right vector space over A. The two dimensionalities, left and 
right, of 9? over A are equal. In fact, it is clear that, if (e iy e 2y 

• • • , e n ) is a left (right) basis for 9J, then it is also a right (left) 

basis; for, if x = then x = He^i. Moreover, if Heidi = 0, 

then H8i A ei = 0, b A = 0 and 5i = 0. We remark also that any 
left subspace of 9? is a right subspace and conversely. 

We can now consider bilinear forms connecting the left vector 
space 9 1 and the right vector space 9?. Such forms will be called 
scalar products . Thus by definition a scalar product is a function 
g{x y y) defined for pairs of vectors ( x , y ) belonging to 9t, having 
values in A, and satisfying the equations 

(24) g(x! + * 2 , y) = g(x u y) + g(x 2 , y), g(ax, y) = ag(x, y) 

(25) g(x,y i +y 2 ) = gfayj + g(x,y 2 ), g(x, ay) = g(x,y)cr, 

6 

for since ay = ya y the second condition in (25) is equivalent to 
the second part of (6). 
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If (eiy e 2y • • •, e n ) is a basis for 9? over A, the matrix (/3), /? y = 
g(ti> ej) is called the matrix of the scalar product relative to this 
basis. Evidently this concept is a specialization of our former 
one of the matrix of a bilinear form determined by bases in the 
spaces connected by the form. As in the general case, the matrix 
and the basis completely determine the function g(x> y); for if 
x = and y = then by (24) and (25) 

(26) g(x, y) = J2 hfiaVi- 

Conversely, if (e iy e 2 , • • *, e n ) and (£) are given, then the equation 
can be used to define a scalar product in 9t. 

We consider now the effect of a change of basis on the matrix 
of a scalar product. Let (/i, f 2y • • • , f n ) be a second basis of 9? 
and suppose that /*• = 2/x^y. Then if we regard these as right 
vector spaces, the relation between them is fi = XejVji where vji 
= pLij. Thus the matrix connecting the right bases is ( v ) = {jX)' y 
and this matrix is called the conjugate transpose of the matrix 
(/*)• Now we have seen (p. 139) that the new matrix of g{x y y) 
is (7 ) = (m)(/ 3) O'). Hence we have the relation 

(27) (7) = (m)G8)(m)' 

where (/*) is the matrix that gives the change of left bases. Two 
matrices (/3) and (7) connected as in (27) by a non-singular matrix 
(ju) are said to be cogredient (relative to the given anti-automor- 
phism). It is easy to see that this relation is an equivalence. 
The result we have established is that any two matrices of a 
scalar product are cogredient. It is also clear that, if (/3) is the 
matrix of a scalar product, then any matrix cogredient to (/ 3 ) is 
also a matrix of the scalar product. 

If © is a subspace of 9?, then it is evident that the contraction 
of the function g(x , y) to the pairs of vectors in © is a scalar prod- 
uct for ©. It is now natural to introduce the following notion of 
equivalence between subspaces of 9 1. We say that the subspaces 
©x and © 2 are g-equivalent if there exists a 1-1 linear transforma- 
tion U of @1 onto © 2 such that £(.Vi,jyi) = g&iU, yiU) holds for 
all Xiy y 1 in ©1. If (e ly e 2y • • •, e r ) is a basis for ©1, then (ei U y 
e 2 U y •••, e r U ) is a basis for © 2 , and g(e iy ej) = g(eiU y ejU). 
Hence the matrices of the contractions of g relative to these 
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bases are identical. It follows that, if arbitrary bases are chosen 
in ^-equivalent subspaces, then the matrices determined by these 
bases are cogredient. Conversely, suppose that ©i and © 2 are 
subspaces such that the matrices of the contractions of g to these 
spaces are cogredient. Then by an appropriate choice of basis 
in ©2 we can suppose that we have a basis t, e 2 , • • • , e r ) for 
©i and a basis (/i,/ 2 , • • *,/ r ) for © 2 such that it, e 3 ) = i(A/y). 
Now let Xi = 2 yx = 2^*/ be any two vectors in © x . We 
have £&?(*»•> e,)rij = E ?#(/»> />)% and tflis S ives s( x uJi) = 

i,j *,3 

g(^i/b Zrijfj). Now the mapping x x = 2 2f ? -/,- is a 1-1 
linear transformation of ©i onto © 2 . Hence the foregoing rela- 
tion shows that ©i and © 2 are ^-equivalent. 

7. Hermitian scalar products. We shall assume now the anti- 
automorphism a — > a is involutorial in the sense that a = a for 
all a. We consider first the theory of scalar products which are 
hermitian in the sense that 

(28) g(y, x) = g(x, y ) 

for all x and y. If A = <£ is a field and a = a (this is allowed), 
then the hermitian condition reads 

(29) g(y, x) = g{x,y). 

A scalar product satisfying this condition is said to be symmetric . 

If e 2 y ' ' ' y tn) is a basis for over A, then the condition 
(28) implies in particular that 

Pij = it, ft = it, ft = ft-,*. 

Hence a necessary condition that g(x y y) be hermitian is that the 
matrices (ft of the scalar product be hermitian in the sense that 

(30) (ft' = (ft. 

This condition is also sufficient; for, if it holds and x = 2 and 
y — 2 t 7 ,a-, then 

g(y y ft = == 2?7 »ftif/ 

and 

g(Xy y) = = 2?7 

Hence g(y y ft = g(x y y). In particularity) is symmetric if and 
only if its matrices are symmetric ((ft' = (ft). 
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If g{x y y) is hermitian, g(u y v) = 0 for the particular vectors 
u y v implies g(v y u ) = g{u y v ) = 0. If this holds, then we say that 
the vectors u and v are orthogonal (relative to g). Our remark 
shows that this relation is a symmetric one. If © is a subspace 
of 9?, we define the orthogonal complement © x to be the space of 
vectors v that are orthogonal to every vector we©. This coin- 
cides with our earlier definition ofj(©). It should be noted that, 
except in important special cases, some of which will be consid- 
ered in the next chapter, ©- 1 - is not in general a complement of 
© in the lattice of subspaces. 

The subspace 9J -1 of vectors z that are orthogonal to every 
x e 9J will, as before, be called the radical of g(x y y). The scalar 
product is non-degenerate if its radical 9J 1 - = 0. In this case we 
know that the mapping © — » ©- 1 is an anti-automorphism of 
the lattice of subspaces onto itself. Also we know that dim © x 
= n — dim ©. Hence dim ©■ L± = dim ©. Since it is clear that 
@- LJ - Z) ©, this relation gives © J " J " = ©. Thus the mapping 
© — » ©- 1 determined by a non-degenerate hermitian scalar prod- 
uct is an involutorial mapping in the lattice of subspaces. 

A subspace © will be called isotropic if © fl ©^ ^ 0. This 
evidently implies that © contains a non-zero vector u which 
is isotropic in the sense that g{u y u) = 0. A subspace © will be 
called totally isotropic if © C ©- 1 . 

If g is non-degenerate and © is a non-isotropic subspace, then 
© H © x = 0 and dim ©- 1 = n — dim ©. Hence in this case we 
have the decomposition 9t = © ® ©- 1 . 

EXERCISES 

1. Show that the existence of a hermitian scalar product 0 implies that the 
anti-automorphism a — > a is involutorial. (Hence the latter condition is super- 
fluous in the above discussion.) 

2. Show that the existence of a scalar product j*0 which satisfies g(x,y) = 
g(y> x ) implies that a = a and hence that A is a field. 

3. Show that, if (a) and (/3) are hermitian, then so are ( a ) db (/3), (a) r y («)(/?) + 

raw. 

4. A scalar product h(x y y) is called skew-hermitian if h(y y x) = —h(x y y). Let 
fx be an element of A that satifies Ji = —fi 0 and let h{x y y) be skew-hermitian. 
Prove that the mapping £ — » £* = jn _1 £/x is an involutorial anti-automorphism 
in A and that g{x y y) — h(x y y)n is hermitian relative to this anti-automorphism. 
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8. Matrices of hermitian scalar products. In our discussion 
of hermitian scalar products we shall find it convenient to treat 
separately the symmetric forms over fields of characteristic 2. 
We therefore assume in this section that, if g is symmetric, then 
the characteristic of A = <£ is not two. 

We consider first the function g(x , x) of a single vector x de- 
termined by the scalar product. If g(x y y ) is symmetric, the as- 
sociated function g{x y x) is called the quadratic form determined 
by the symmetric form. In general, we shall say that the ele- 
ment /? of A is represented by the scalar product if there exists a 
vector u 0 such that g{u y u) = /3. Since 

g (u y u) = g(u y u) y j8 = /3. 

Hence the elements represented by the scalar product are invari- 
ant under the anti-automorphism. An important step in proving 
the main result on hermitian scalar products is the following 

Lemma. If g{x y y ) is a hermitian scalar product 0, then there 
exist non-zero elements which are represented by the scalar product . 
{In other words , if is not totally isotropic y then it contains a non- 
isotropic vector.) 

Proof. If the conclusion is false, g{u y u) = 0 for all u. Hence 

g(x, y) + giy, x) = g(x + y, x + y) - g(x, x) - g(y, y) = 0 

for all *•, y. Since g(y, x) = g(x, y) this gives g(x, y) = -g(x, y). 
Now since g{x y y) 0, there exist vectors u y v such that p = 
g{u y v ) ^ 0. If we replace u by p~ l u and change the notation, 
then we can suppose that g{u y v) = 1. Then for any a in A, 
g{au y v) = —g{au y v) and a = —a. Since 1 = 1, this implies 
that the characteristic is two and that a = a. Hence our anti- 
automorphism is the identity mapping and A = $ is commuta- 
tive. This case is ruled out by assumption. 

The argument we have just used is not a constructive one. 
However, in the special case in which g{x y y) is symmetric, we 
can easily give such a method to find a vector u such that g{u y u) 
0. Thus let {e Xy e 2 > • • • , e n ) be a basis for 9 $ over 3>. Then 
g(ei y ej) 9^ 0 for some pair d y ej. If / = j y we can take u = ei. 
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Otherwise, we can assume that g(e iy ei) = 0 = g(e 3y e 3 ) and 
g(e iy ej) ^ 0 for / ^ j. Then g(a + e jy e { + e 3 ) = g(e iy e x ) + 
g{e iy ej) + g(e jy e { ) + g(e jy e 3 ) = g(e iy e 3 ) + g(e jy e % ) = 2 g(e iy e 3 ) 

9 ^ 0 . 

We now return to the general case and we shall prove the fol- 
lowing 

Theorem 3. If g(x y y) is a hermitian scalar product y then there 
exists a basis ( U\ y u 2y • • •, u ry z ly z 2y • • - y z n _ r ) such that 

(31) g{u iy Ui) = Pi 5* 0 y i = 1, 2, • • •, r y 
and all other products are 0. 

Proof. The result is trivial if g = 0; for then any basis serves 
as a set of z’s. If g 9 * 0 y we can take Ui to be any vector such 
that g(ui y Ui ) = Pi 9* 0. Such vectors exist by the Lemma. 
Now suppose that we have already determined linearly inde- 
pendent vectors {u\ y u 2y • • Uk) such that g(u{ y ul) = Pi 9 £ 0 
and g{ui y Uj) = 0 if / 9 * j . We introduce the mapping E k de- 
fined by 

k 

(32) x -» g(x, u l )fii~ l u i . 

1 

Clearly E k is linear and maps 9? into the space ©* = [u ly u 2y • • •, 
u k ] . Also UjE k = g( u h ul)Pi~ l Ui = u 3 . Hence E k is the iden- 

i 

tity mapping in It follows that E k 2 = E k so that, if we set 
F k = 1 — E ky then = ©& © dtF k . Also we have 

g(xE k , Uj) = g ( 2 g( X > Uj 

M=1 
k 

= E g( x > u >) 

i = 1 

= g(x, Uj). 

Hence g(xF k , uj) = £(x(l - E k ), uj) = g{x, uj) - g(xE k , uj) = 0. 
Thus the vectors in dlF k are orthogonal to every vector in 
that is, ?ftF k c ©i,- 1 -. We consider the scalar product g(x, y) in 
3tF k . If this is 0 in 3i F k , we choose a basis (zj, z 2 > • • •, z m ) for 
Since 3J = ©*© 3tF*, (u u u 2) • • •, u k , z u z 2 , • • •, z m ) is a 
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basis for 9?. Since the z’s are orthogonal to the us and to the z’ s, 
this is a basis of the required type. On the other hand, if g 5 * 0 
in dtFky then we can find a vector u k+ i in this space such that 
g(u k+ 1 , u k + 1 ) = i8 fc+ i ^ 0. Then {u u u 2 , • • «*+i) is a linearly 

independent set and, since u k+ 1 is orthogonal to the other 
the new set satisfies the same conditions as (u u u 2y ••*, u k ). 
The process can then be repeated. 

The method that we have given for finding a normalized basis 
is in essence due to Lagrange. It should be noted that Pi can 
be taken to be any element that is represented by the scalar prod- 
uct. Also we note that any Ui can be replaced by y 7 i ^ 0. 
This replaces Pi by P/ = g{yiU iy y iUi) = yiPai- In carrying out 
the above process it is necessary to have a basis for 9 \F k . This 
can be done by supplementing (#i, u 2y • • *, u k ) to a basis {u l5 
u 2 y • • *, u k , v ly v 2y • • •, for 9t. Then the viF k form a basis 

for 9 3F k . 

EXERCISES 

1. If $ is the field of rational numbers, find a diagonal matrix (/3) cogredient to 

'-2 3 5' 

(a) = 3 1 -1 • 

.5-1 4. 

Also find a non-singular matrix Qjl) such that (jjL)(a)(p) f = (/3). 

2. Show that the space [zi, Z 2 , • • •, z n _ r ] given in Theorem 3 is the radical of 

gfay)- 

3. Suppose that has characteristic ^ 2 and that y) is a symmetric non- 
degenerate scalar product in 9t over Prove that, if g represents 0, then g is 
universal in the sense that it represents every element of 

9. Symmetric and hermitian scalar products over special divi- 
sion rings. The foregoing discussion reduces the problem of co- 
gredience for hermitian matrices to that of finding conditions for 
cogredience of diagonal matrices. No general solution of this 
problem is known. The known results are all special in that they 
make use of special assumptions on the nature of A and of the 
anti-automorphism. For example, a complete solution of the co- 
gredience problem is known for symmetric matrices over the field 
of rational numbers. The theory for this case, which is due prin- 
cipally to Minkowski and Hasse, is arithmetic in nature and will 
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not be discussed here. Instead, our plan for the remainder of 
this chapter is as follows: In this section and in § 10 we consider 
the special cases of the cogredience problem which should be 
familiar to every student of mathematics. In the last two sec- 
tions we consider questions which are of interest primarily to 
specialists in algebra. Thus in § 11 we consider again the gen- 
eral theory of hermitian forms and we discuss Witt’s important 
generalization of the notion of signature. In the final section 
(§ 12) we consider the theory of symmetric scalar products over 
a field of characteristic 2. 

We specialize first the result of the preceding section to sym- 
metric matrices and symmetric scalar products (A = $ a field, 
a = a). We assume <£ has characteristic ^ 2. Suppose that 
g(x y y) is a symmetric scalar product and let (u ly u 2 , ••*, u n ) 
be a basis such that 

g(u iy u}) = dyPi, 1 3i 9 * 0, i = 1 , 2, • • *, r. 

Then r is the rank of the matrix diag {jSi, /3 2 , • • •, j3 n } of g(x y y) y 
and r is the common rank of all the matrices of this scalar prod- 
uct. As we have seen, we may replace Ui by Vi = y iU{ y y * ^ 0. 
Then the v’s form another basis and the matrix determined by 
this basis is 

diag {Pi, p 2 ', • • •, Pn'} 

where p/ = y i 2 Pi- Thus we see that any Pi can be replaced by 
P/ = ji 2 Pi, y i 0. Suppose now that is a field in which every 
element is a square. As usual, if y 2 = P, we write y = P^. Then 
if Pi ^ 0, let y , = Pi~ H = This choice of y { replaces pi 

by P/ = 1 • Hence our symmetric scalar product has a matrix of 
the form 

(33) diag {1,1, 1,0, •••,<>}. 

This form is applicable in particular if $ is an algebraically closed 
field. 

If (13) is any symmetric matrix, then (/3) can be used to define 
a symmetric scalar product in over 4>. The matrices of this 
scalar product constitute the cogredience class determined by (/3). 
Thus we see that, if $ is algebraically closed of characteristic 5 * 2, 
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then any symmetric matrix (ft in $> n is cogredient to a matrix of 
the form (33). Clearly (33) is completely determined by the 
rank of (ft. This implies the following 

Theorem 4. If <t> is algebraically closed of characteristic ^ 2, 
then any two symmetric matrices in are cogredient if and only if 
they have the same rank. 

We assume next that $ is the field of real numbers. Let the 
vectors Ui be arranged so that 

Pi > 0, i = 1, 2, • • •, p\ ft < 0, j = p + 1, * • •, r. 

Then we can extract a square root of ft, i < p and of — ft, 
r >j > p. Accordingly, let y *• = ft“ H , yj = ( — ft)“ M . Using 
these yi as above, we obtain a matrix 

, p — , - — r — p 

(34) {1, •••, 1, -1, •••, -1,0, ---,0} * 

for g{x y y). It follows from this that any real symmetric matrix 
is cogredient to a matrix of the form (34). We shall show next 
that two matrices of the form (34) cannot be cogredient in 4> n , 
<$> the real field, unless they are identical. This will follow from 

Theorem 5 (Sylvester). If the diagonal matrices 

{181,162, 

are cogredient in <£ n , $ the field of real numbers , then the number p 
of positive pi is the same as the number p' of positive ft'. 

Proof. The given matrices may be taken to be matrices of the 
same symmetric scalar product in 9? over4>. We may suppose that 
the first p Pi are > 0 and the first p r Pi are > 0 and that in both 
matrices the first r elements are 9 ^ 0 and the last n — r are 0. 
Let («j, u 2 , • • •, u n ) be a basis relative to which the matrix of 
g(x y y) is {Piy p 2y • • •, P n } and (v ly v 2 > • • •, v n ) a basis for which 
the matrix is {P\ y Pf, * * *, ft/}. It is easy to see (Ex. 2, p. 154) 
that the radical 

tR = \Ur- j-l> 2y *> ^n] = \Pr-\- ly ‘3 ^n]« 

* In the remainder of this chapter we drop the symbol “diag” in the notation for diag- 
onal matrices. 
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We now introduce the following subspaces: 



9^_j_ \j^\y *3 Mp\ 

©+ = [Vly v 2 , - y V P '] 



— [^p+lj ^p+2y *3 ^r] 

©_ = [frp'-f- 1 , Pp'_|_23 *3 ^r]« 

p n 

Let y e 9J+ + 5f x . Then y = ^2 »?.«*• + ^ ’Jyft an d 

1 r+l 

g(y> y) = Z Vi 2 Pi- 
1 

Since ft > 0 for * = 1, 2, •••,/>, £(y,.y) > 0 and £(y, y) = 0 
only if all the rn = 0 , i < p. Thus if y e 9?+ + 9? x , giy, y) > 0 
and g{y, y) = 0 only if y e 9t x . A similar result holds for © + 
+ 9? x . On the other hand, if jy e 9J_ + 9f x or to ©_ + 9i x , then 
a similar argument shows that giy, y) < 0 and that giy, y) — 0 
only if y e $K X . Now let y e (9i + + 3i x ) D (©_ + 9f x ). Then 
giy, y) ^ 0 and giy, y) < 0. Hence giy, y) — 0 and y e $R X . 
This establishes the following relation : 

(35) (3t+ + $R X ) fl (©_ + 3f x ) = 3l x . 

We now make use of the general dimensionality relation (Ex. 3, 

p. 28): 

dim (©i fl © 2 ) = dim ©1 + dim © 2 — dim (©1 + © 2 ) 

> dim ©1 + dim © 2 — n. 

Applying this to (35) we obtain 

n — r > p + (« — r) + (r — p r ) + (» — r) — n. 

Hence p — p' < 0. Similarly p' < p and so p = p'. 

Sylvester’s theorem shows that the number of positive elements 
in any diagonal matrix that is cogredient to a given real sym- 
metric matrix (/3) is an invariant. Likewise the difference 2 p — r 
between the number of positive elements and the number of neg- 
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ative elements in any diagonal form is an invariant. We call this 
number the signature ( inertial index) of the matrix. The main 
theorem on real symmetric matrices may now be stated as 

Theorem 6. Two real symmetric matrices are cogredient if and 
only if they have the same rank and the same signature. 

We assume next that $ is the field of complex numbers and that 
a — > a is the mapping of a complex number into its complex 
conjugate. Let g(x } y) be a hermitian scalar product in 91 over $ 
associated with the usual mapping a — > a. For any u, g(u, u) = 
g{u y u) is real. In particular, if (u 1} u 2 , • • •, u n ) is a basis such 
that (31) holds, then the elements ft are real. If we replace «,• 
by Vi = yiUi, ji ^ 0, then ft is replaced by yft-ft = | | 2 ft. It 

follows in this case, too, that 91 has a basis relative to which the 
matrix has the form (34). 

Sylvester’s theorem holds in the present case also. It asserts 
that, if<£ is the complex field and {ft, ft, • • •, ft} and {ft', ft', 
• • ft 7 } are cogredient in the sense that there exists a matrix 
(jj.) in L(3>, n) such that 

ft ', • • •> ft 7 } = ( ft { ft , ft , • • •, ft } (/0 7 , 

then the number of positive ft is the same as the number of posi- 
tive ft 7 . The proof is exactly the same as before. The essential 
point is that, if 

V n 

jv = X) v iUi + X vjUj 

i r+1 

and ft > 0 for i < p and ft = 0 for j > r, then 
g(y,y) = 2 1 77 * | 2 ft > 0. 

As in the real case we have the criterion that two hermitian 
matrices are cogredient relative to the anti-automorphism a — > a 
if and only if they have the same rank and the same signature. 
Here again we define the signature to be the difference between 
the number of positive elements and the number of negative ele- 
ments in a diagonal matrix cogredient to the given hermitian 
matrix. 
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We consider finally the theory of quaternionic hermitian forms. 
Let A be the division ring of Hamilton's quaternions and let 
a —> a be the mapping of a quaternion a = a 0 + <*\i + oc 2 j + 
a z k into its conjugate 

a — qlq — a\i — a 2 j — ol %k. 

If g( x > J ) is hermitian in over A, g{u y u) = g{u y u) is real. 
Also aa = a 0 2 + a 2 + a 2 + a 3 2 > 0 and equality holds only if 
a = 0. These remarks indicate that this case is essentially the 
same as the complex hermitian case. Matrices of the form (34) 
serve as canonical forms under cogredience. Sylvester’s theorem 
and the final result that equality of ranks and of signatures are 
necessary and sufficient conditions for cogredience hold. 



EXERCISES 

1. Prove that the number of cogredience classes of real symmetric matrices of 
n rows and columns is ( n + 1 ){n + 2)/2. 

2. Find a matrix of the form (34) cogredient to the real symmetric matrix 

- 1 3-5- 

3-1 0 • 

.-5 0 2. 

3. Find the matrix of the form (34) that is cogredient to the hermitian quater- 
nion matrix 

•-1 1 + / — 2j -2 i + k' 

1 - i + 2j 0 4 j 

-2 i - k -4 2 

4. Prove that, if (a), (/3), •••,(?) are a) real symmetric, or b) complex hermitian, 
or c) quaternionic hermitian, then 



M 2 + (0) 2 +--*+W 2 = o 



can hold only if (a) = (J3) = • • • = (v) = 0. 

10. Alternate scalar products. A scalar product g(x 3 y) is called 
skew symmetric if g(x y y) = —g(y y x) for all x and y. In this 
case g(x y x ) = — g(x, x). If the characteristic is not two, then 
this implies that g(x y x) = 0. If the characteristic is two, it 
may still be true that g(x y x) = 0 for all x. We shall now call 
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a skew symmetric scalar product alternate if g( x y x) — 0 for all 
x y and we consider the problem of finding suitable canonical 
bases for this type of scalar product. The proof of the lemma on 
p. 152 shows that, if g(x y y) is alternate, then it is skew symmetric 
and that, if g ?*■ 0 is skew-symmetric, then a = a and A = 
is commutative. 

Suppose that g(x y y) is alternate and not identically 0. Then 
we can find a pair of vectors u y v such that g{u y v) 9 ^ 0. By re- 
placing v by a suitable multiple Vi of v we then obtain u x = u 
and Vx such that g(u ly Pi) = 1. Then g(v 1 , # 1 ) = — 1, g(u ly u x ) 
= 0 = g(v iy Vi). It follows that U\ and Vi are linearly independ- 
ent. Now suppose that we have already found k pairs of vectors 
Ui y v iy u 2y v 2y • • *, u ky vjc that are linearly independent and satisfy 

(36) g(u iy Vi) = 1, g(v iy Ui) = -1 

with all other products 0. Let E k denote the linear transforma- 
tion 

k k 

(37) X — > ^2g(x 3 Vi)Ui — 

1 1 

As in the proof of Theorem 3 we see that E k is a projection on 
the space ©* = [u ly v ly u 2y v 2y • • •, u ky v k ]. Hence if F k = 1 — E ky 
Also we can verify that g{xE ky ul) = g(x y ui) 
and g(xE ky v z ) = g{x y v t ). Hence g(xF ky u t ) = 0 = g(xF ky v t ) and 
dtF k c ©zb -1 -* Now either g is 0 in 9tF& or we can choose a pair 
of vectors u k+u v k + 1 in this space such that 



g(M k+ 1 , v k +i) = 1 = -g(v k+ 1 , «* + i). 



Since ©* fl = 0, ( u ly v iy u 2y v 2y • • •, «jb+i,»*+i) is an independ- 
ent set and since the last two vectors are orthogonal to the pre- 
ceding ones the set of 2{k + 1) vectors satisfies the same condi- 
tions as the set (u iy Vi y u 2y v 2y •••, u ky v k ). Eventually we either 
span the whole space or obtain a space 9iF r in which g is 0. In 
the latter case we choose any basis (z 2r+ 1 , • • *, z n ) for this space. 

* ©k 1 is the orthogonal complement of ©*, defined as for hermitian scalar products. 
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Evidently the matrix of g(x y y ) relative to the basis 



is 



(U\yV\y U2y02y ' y U Ty V T y %l>%2y * y %n — 2 r) 



0 1 

-1 0 






0 1 

-1 0 



r blocks 



(38) 



0 1 

-1 0 



0 



l 01 

This proves the following 

Theorem 7. If g(x,y) is an alternate scalar product, there exists 
a basis for 9i relative to which the matrix has the form (38). 

If g(x, y) 7 * 0 is alternate, the anti-automorphism is the identity 
and the matrices (/3) of g(x, y) are alternate in the sense that 
(f3)' = — (/3) and /?,■» = 0 for i — 1, 2, •••,«. These conditions 
are also sufficient; for if .v = then 

g(x, #) = 2 PifA) = 2 Pi&f + E (Pi) + Pjd^i^j = 0. 

ij * i <3 

Hence g(x y y ) is alternate. The following results are now easy 
consequences of Theorem 7. 

Corollary 1. The rank of an alternate matrix with elements in a 
field is even. 

Corollary 2. Two alternate matrices in <£ n are cogredient if and 
only if they have the same rank. 
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1. Prove that 



EXERCISE 



- 0 


2 


-1 


3 - 




' 0 


1 


0 


0- 


-2 


0 


4 


-2 


• 03 ) = 


-1 


0 


0 


0 


1 


-4 


0 


1 


0 


0 


0 


1 


.-3 


2 


-1 


0. 




. 0 


0 


-1 


0. 



are cogredient in <£4, $ the field of rational numbers. Also find a (jjl ) in L($, 4 ) 
such that (/ 3 ) = Qx) (a:) (/*)'. 



*11. Witt’s theorem. We now take up again the general 
theory of hermitian scalar products over a division ring. In the 
present discussion we shall assume that the basic anti-automor- 
phism a — > a satisfies the following solvability condition. 



Axiom S. The equation £ + £ = $ has a solution for every her- 
mitian element (3 of the division ring. 



This axiom is automatically satisfied if the characteristic of A 
is not two; for, in this case, we may take £ = It is also satis- 
fied in the characteristic two case if there exists an element 7 
of the center T of A such that 7^7. Then ^ = 7 + 7^0 is 
in T since the anti-automorphism maps T into itself. Hence if 
£ = / 3 76 -1 , then 

£ + £ = 13 7 S - 1 + Pyd- 1 = Pd-\y + 7 ) = P. 



We remark finally that our axiom rules out the case A = <£, a 
field of characteristic two, a = a. This is clear since in this case 
£ + £ = 0 while 1 6 need not be 0. 

Assume that dt is a vector space over A and that g(x , y) is a 
non-degenerate hermitian scalar product relative to the anti-auto- 
morphism a — > a. We recall that, if © is a subspace of dt that is not 
isotropic in the sense that © f I © x = 0, then we have the de- 
composition dt = © © ( S> ± . The basic result of the theory which 
we shall develop is the following theorem. 



Witt’s theorem. If ©1 and © 2 are non-isotropic and g-equiva- 
lent y then ©i* 1 and ©2* 1 are g-equivalent* 

* Witt proved this result for symmetric scalar products over a field of characteristic 
j* 2 {Journal fur Math Vol. 176 (1937)). The extension to division rings of characteristic 
t* 5 2 is due to Pall {Bulletin Amer. Math. Soc., Vol. 51 (1945)). In the present discussion 
we assume only the foregoing Axiom S. Cf. also Kaplansky, Forms in infinite dimensional 
spaces , Anais Acad. Brasil Ci. 22 (1950), pp. 1-7. Witt’s theorem does not hold for sym- 
metric scalar products over a field of characteristic 2 (see the next section). 
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Proof. It suffices to prove the result for dim © x = 1; for, if it 
is known in this case, then we can use induction on dim © x as 
follows. We choose a vector U\ in © x such that [ui] is not iso- 
tropic (Lemma to Theorem 3). Then © x = [«i]®U x where 
U x Q [ui\ ± . Using the equivalence of © x and © 2 , we can write 
©2 = [« 2 ] © U 2 where U 2 c [wj- 1 , [« x ] and [« 2 ] are equivalent and 
Ui and U 2 are equivalent. Then [ui] 1 - = U x © ©^ and [u 2 ]' L = 
U 2 © © 2 J " are equivalent under a transformation £/. Hence 
[« 2 ] x = U 2 © © 2 X = U X U® ©i^U. Now U 2 and U X U are equiva- 
lent and U x £7 and © x ± U are orthogonal. Since dimU x ?7 < 
dim ©i and U x is not isotropic, we can assume that © X ± C7 and © 2 ± 
are equivalent. This implies that © x x and © 2 X are equivalent. 

We note next that the case: © x = [« x ], dim arbitrary, can 
be deduced from the special case: © x = [« x ], dim dt = 2. Thus, 
suppose we know the result in the special case. Let © 2 = [u 2 ] 
be equivalent to [« x ]. We have the decompositions 9 1 — [« x ] © 
[«i] x = [« 2 ] ® [« 2 ] x and g{u u u x ) = g(u 2 , u 2 ). If [«i, « 2 ] is one- 
dimensional, [«i] x = [a 2 ] x , and the theorem holds. Hence as- 
sume that dim [ux, « 2 ] = 2. Consider first the case in which 
this space is not isotropic. Here 

9? = [«i, « 2 ] ® \u\, « 2 ] x 

(39) = [i ux ] ® ([«i, u 2 ] D [«i] x ) © [u 1} « 2 ] x 

= [«2] ® ([«i, «2] n [« 2 ] x ) © [«i, a 2 ] x . 

Then [« l5 « 2 ] H [«j] x and [« l5 « 2 ] H [« 2 ] x are equivalent by the 
case dim 9f = 2 (applied to 91 = [u 1} « 2 ]). Hence by (39) 

[«i]' L = ([«ij «2] n [«i] x ) © [«i, « 2 ] x 

and 

[« 2 ] x = ([«i, « 2 ] D [m 2 ] x ) © [«i, « 2 ] x 

are ^-equivalent. Next consider the case in which [« 1} « 2 ] is iso- 
tropic. Here we have a vector w 9 * 0 in this space such that 
g(w, Ux) = 0 = g(w, u 2 ). We can find a / in 9? such that g(w, t) 
9^ 0. Then the matrix of g in \u x , u 2 , t] relative to the basis 
(w, u 1} t) is 

'0 0 g{w,t)~ 

0 g{Ux, Ux) * 

■git, w) * * 
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It is evident that the row vectors of this matrix are linearly 
independent; hence the matrix is non-singular and [u iy u 2y /] is 
not isotropic. Hence 

dt = [u ly u 2y /] ® [u u u 2y t} 1 - 

= [«J ® ([«i, u 2y t\ n [aiH ® [u ly u 2y t] x 

= [«2] ® ([«1, U 2y t) n [u^) © \u ly U 2y t] L . 



Thus, it suffices to show that [u ly u 2y t ] fl [u^\ L and [u ly u 2y /] fl 
[u^[ L are equivalent. Now these are non-isotropic two-dimen- 
sional spaces that contain an isotropic one-dimensional subspace 
[w]. In a space of this type we can always select a basis relative 



to which the matrix is 



0 1 ' 

.1 0 . 



To see this, choose q so that 



g(w y q) = 1 = g(q y w) and set z = q + \w. Then g(z y w) — 1 
= g(w y z) also while g(z y z) = g(q y q) + X + X. By Axiom S we 
can choose X so that g(z y z) = 0. This gives a matrix of the re- 
quired form. It follows now that [u ly u 2y /] fl [ui] 1 - and [u Xy u 2y /] 
D [u 2 ] l are equivalent. This completes the reduction to the 
case: dim = 2, dim ©i = 1. 

We consider finally this special case. The result we wish to 
prove here is equivalent to the statement that, if the non-singu- 
lar diagonal hermitian matrices {a y Pi} and {a y fi 2 } are cogredient, 
then the elements /?i and /3 2 are cogredient. Let (fx) be a non- 
singular matrix such that 



Mil M12" 




'a 0 




Mil M21 




at 0 " 


-M21 M22- 




-0 fa . 




M12 M22- 




-0 fa . 



If A is commutative, we take determinants and we obtain 0 2 
= n/3 1/2 where /x = det (/x r -y), which is the desired result. In the 
general case we shall work directly with the conditions that are 
given by the matrix relation. These are 

(41) Mn<*M li + M 12 &M 12 = a 

(42) Mll«M2 1 + M I2pl&22 = 0 = jU21«MU + M220lMl2 

(43) M21«M21 + M22^lM22 — &2- 

If /tn = 0, fi 22 = 0 by (42) since M 12 t 6 0. Hence a = n 12 P 1 V 12 
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and 62 = M2i«M2i* These relations imply that 61 and /? 2 are co- 
gredient. 

Assume, therefore, that mii ^ 0. Then by (42) 

<*M21 = — Mil” Vl2$lM22 
M21 = — M220iMi2Mu -1 a -1 

so that by (43) 

M22(^lMl2Mll — 1 «~ 1 Mll — 2 Ml2^1 + /3 i)m 22 = /?2- 

Thus 62 is cogredient to /3iMi2Mu -1 a -1 Mii -1 Mi2/3i + j8i. We 
wish to show that the latter element is cogredient to j8 x . For 
this purpose we try to solve 

(44) (1 + /3iMi 2 &ii 2 )/3i(l + M12IM12/31) 

= |3iMi2Mii -1 « -1 Mii -1 Mi20i + 0i* 

This will be satisfied if 

(45) £ + £ + ?Mi2/5iMi 2 | = (mii«Mii) - 1 - 
Replacing Mi2j8iMi2 according to (41) we obtain 

(46) £ + £ + £(« — MuaMu)l = (MuaMii) -1 . 

If Mu — lj (46) reduces to £ + £ = (mh«Mh) -1 , which is solva- 
ble by Axiom S. If mu ^ 1, we make the substitution £ = ij -1 
and multiply on the left by t\ and on the right by r\ to obtain 

(47) V + V + (a ~ Mii^Mu) = »?(mii«Mii) -1 ^- 
Next we substitute ij = f + Mii«Mn and obtain 

(48) t(Mii«Mu) -1 r = «• 

Now (48) is satisfied by f = — a/x n ; hence (47) is satisfied by 
V = (mu — l)«Mn which is not 0. Then £ = ij -1 satisfies (46). 
Thus (44) holds and /3i and /3 2 are cogredient. This completes 
the proof. 

A 1-1 linear transformation U of SR onto itself is said to be 
g-unitary if g(xU,yU) = g(x,y) holds for every pair of vectors 
x, y in Evidently this condition is equivalent to the require- 
ment that UU' = 1 where U' is the transpose of U relative to the 
scalar product. Now suppose again that and @ 2 are non-iso- 
tropic spaces which are ^-equivalent and let M be a ^-equivalence 
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of ©i onto © 2 - By Witt’s theorem we can find an equivalence 
TV of ©I -1 onto © 2 " l . Since dt = ©i ® ©i 1 *, any vector x can be 
written in one and only one way as u + v where u e ©i and 
v e ©i -1 . Then the mapping U: x — > uM + vN is a 1-1 linear 
transformation of $R onto itself. Moreover, making use of the 
fact that uM e © 2 and vN 8 © 2 " L , we can verify directly that U is 
^-unitary. Evidently U coincides with the given equivalence M 
on the subspace ©i. Thus we see that Witt’s theorem implies 
that any ^-equivalence between non-isotropic subspaces can be 
extended to a ^-unitary transformation. We shall now show that 
this result holds also for isotropic subspaces of SR. 

Consider first an arbitrary subspace © of SR. If x is any vec- 
tor in SR, then the mapping y — > g x (j) = g(y, x) of © into A is 
a linear function. It is easy to see that the linear functions thus 
obtained fill up the conjugate space ©* of © (Ex. 2, p. 56). 
Hence if (ju y 2y • • •, y m ) is any basis for ©, then we can find a 
vector Vi such that 

(49) g(y ly 01 ) = 1, g(y iy »i) =0 for i > 1. 

Assume now that © is isotropic and that (y ly y 2 , • • *, y v ) is a 
basis for the radical of ©. Then we can choose the vector so 
that in addition to (49) we have g(v ly v^ = 0. This can be seen 
by an argument used in the proof of Witt’s theorem. Thus if 
Vi is not orthogonal to itself, then we can replace this vector by 
z>i + Xy 1 and choose X so that X + X + ^(^ 1 , z>i) = 0. We then 
denote this new vector as Vi. 

Now the space [y 1 , Vx] is a two-dimensional non-isotropic sub- 
space of $R. Hence 9 1 = [y 1 , Vi] © [y 1 , yi] 1 -. Also it is clear that 
\j 2 y • • •, ym] £ \ji, Vi] 1 - and that [y 2 , • • •, y v ] is the radical of 
[y 2y • • *,y m ]- Hence we can use induction on v to prove the exist- 
ence of a set of vectors {v u v 2y • • *, v v ) such that 

g(jjy Vj) = j = 1,2, 

(50) g(ji y vj ) = 0 otherwise 

g(P h v k) =0 j y k = 1, 2, • ■ v. 

It is immediate that the vectors (z>i, v 2y • • •, v v ) are linearly inde- 
pendent and that the space 93 = [v ly v 2y • • *, v v ] is totally isotropic 
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and satisfies 25 fl © = 0. The matrix of g in © + 25 relative to 



the basis (y u y 2 , "‘,ym, »i, 


V 2 , " 


, tv) is 








1 




0 


0 


1 


(51) 


0 


B 


0 




1 


0 


0 




1 







where B is the matrix relative to (jv+u •••> jVm). Since [y i} 
• • - is the radical of ©, B is non-singular. Hence (51) is non- 
singular and so © + 25 is not isotropic. 

Now let U be an equivalence of © (onto ©17). Evidently 
[y-JJ, • y r U] is the radical of ©17. Hence we can find a set 
of vectors (yi, v 2 , • • • , v v ) such that 

g(jjU, Vj) = 1, j = 1, 2, • • •, v 

g(jiU, Vj) = 0 otherwise 

g(0j, v k ) =0, j, k = 1 , 2, • • •, V. 

Then it is clear that the linear transformation that sends vj into 
vj and that coincides with 17 on © is an equivalence of © + 25. 
Since © + 25 is not isotropic, this mapping can be extended to a 
^•-unitary transformation. Hence we have proved the following 

Theorem 8. Any g-equivalence of a subspace of 91 can be ex- 
tended to a g-unitary transformation in 91.* 

A hermitian scalar product is called totally regular if g(x, x) 
9 ^ 0 for every x 0 in 91. This is equivalent to saying that 
every non-zero subspace of 91 is not isotropic. Hence if g is 

* Cf. Dieudonn6, Sur les Groupes Classiques , p. 18. 
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totally regular and © is any subspace, then 9? = ©©©L We 
shall now show that the problem of cogredience of hermitian 
matrices can be reduced to the case in which the associated scalar 
products are totally regular. 

Let © be a totally isotropic subspace of 9? which is maximal 
in the sense that it cannot be imbedded in a larger subspace of 
this type. Write © = [y u • • •, y v \ where the are linearly in- 
dependent. As before we determine a totally isotropic space 
93 = bi, • • •, v p ] such that £ = © + 93 = ©®93 and such that 
the matrix of g relative to the basis (y i, • • - , y vy v ly • • •, v v ) of 
Xis 





1 


0 


• 




1 


1 






0 


1 





Since © is a maximal totally isotropic subspace and 9? = © 0 930 
X ± y g is totally regular in X ± - We can choose a basis for 9? so that 
the matrix of g has the form 



( 53 ) 



0 


1 

1 


0 


1 






\ 


0 


0 


1 






0 


0 


B 
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where B is a matrix of g in Thus any hermitian matrix is co- 
gredient to one of the form (53) in which B is totally regular 
(that is, the associated scalar product is totally regular). 

Conversely suppose that we have any matrix of g of the form 
(53) where B is totally regular. Let (j ly • • •, y vy v u *•*, v v , 
Ziy • • •, 2 n _ 2 v) be a basis such that the matrix of g relative to 
this basis is the given matrix (53). Then © = [y u • • •, y v ] is 
totally isotropic. Moreover, © is maximal totally isotropic; for, 
if there is a larger totally isotropic subspace containing ©, then 
it contains a non-zero vector of the form v + z where v e [v ly 
• • •, v y ] and ze[z u • •, z n - 2 J. Then g(v + z, v + z) = g(z> z ) 
= 0 so that z = 0 . If v = 27 iViy g{Vyji) = 7 * Hence also 
v = 0 contrary to v + z 5 * 0 . 

We are now in a position to show that two matrices of the form 
(53) in which the B’s are totally regular are cogredient if and 
only if the matrices B are cogredient. The “if” part is, of course, 
trivial, and the foregoing discussion shows that the “only if” 
part is equivalent to the statement that the spaces X " 1 deter- 
mined as above by a maximal totally isotropic subspace © are 
^-equivalent. 

We observe first that any two maximal totally isotropic sub- 
spaces have the same dimensionality. Thus let ©1 and © 2 be 
of this type and assume that dim ©1 > dim © 2 . Then we can 
find a subspace Ui of ©1 such that dim Ui = dim © 2 . Since Ui 
and © 2 are totally isotropic, any 1-1 linear transformation of 
Ui onto © 2 is a ^--equivalence; hence it can be extended to a uni- 
tary transformation U. Then © 1 U is totally isotropic and con- 
tains UiU = © 2 . By the maximality of © 2 this implies that Ui 
= ©1 and that dim © x = dim © 2 . 

Now let ©*• = [yi (i) , • • •, y v (i) ]j i = 1, 2 , and determine 33* = 
bi (i) j * * *3 tv (<) ] as above so that the matrix of g in 3 U = ©* + 33 * 
is (52). Then £1 and £ 2 are ^-equivalent; hence also Xi 1 - and X 2 ± 
are ^-equivalent. We have therefore established the following 

Theorem 9. Any non-singular hermitian matrix is cogredient 
to one of the form (53) in which B is totally regular^ two matrices 
of the form (53) in which B is totally regular are cogredient if and 
only if the submatrices B are cogredient . 
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The number of rows and columns of the matrix B is, of course, 
an invariant of the cogredience class. This number is the di- 
mensionality of the spaces X -1 and is also n — 2v where v is the 
maximum dimensionality of totally isotropic subspaces relative 
to g. We shall call this non-negative integer the Witt signature 
of g or of the associated matrices. It can be seen (Ex. 1 below) 
that for symmetric matrices over the real field or for hermitian 
matrices over the complex field or over real quaternions the Witt 
signature is the absolute value of the ordinary signature which 
we defined before. 

EXERCISES 

1. Prove the foregoing statement on signatures. 

2. If g is an alternate non-degenerate scalar product, a 1-1 linear transforma- 
tion S of such that £(* <9, jyS) = g(x>y) is called a symplectic transformation. 
Prove the following analogue of Theorem 8: Any g-equivalence of a subspace of 
9? can be extended to a symplectic transformation. 

*12. Non-alternate skew-symmetric forms. We suppose finally 
that g(x, y ) is a skew symmetric scalar product that is not alter- 
nate. Then <l> has characteristic two and g(x> y) may also be re- 
garded as symmetric. Moreover, we have seen that, if (e u e 2 , 
• • •, e n ) is a basis for 9t, then g(e ef) 0 for some /. We shall 
now show that it is possible to choose a basis (#i, u 2j • • u ry 
Zi, z 2 > * * * ? %n-r) for such that the matrix determined by this 

hacic i o 

0 ; 0 . Evidently U\ can be chosen so that («i, «i) = |3i 5 ^ 0. 

Now suppose that «i, u 2 , • • •, u k have already been found such 
that g(u { , Uj) — SijPi, 7 ^ 0. As in the proof of Theorem 3 we 
can write 9 ? = ©*© where ©* = [u\, u 2 , • • •, «*] and StF* 
C ©jA. If g(x, y ) is identically 0 in 9 t?% we set k — r and choose 
a basis (zj, z 2 , ■ ■ - , z„_ r ) in $lF k . If g(x, y) is not alternate in 
StF/c, then we choose a vector u k+ 1 in this subspace so that 
g(uk+i, Mfc+i) = Pk + 1 7 ^ 0 . We then repeat the argument with 
the k + 1 us. It remains to consider now the case in which 
g(x, y) is not identically 0 and alternate in DbE*. Here we can 
find two linearly independent vectors v, w such that 

(v, v) = 0 — ( w, w), ( v , w) = 1 = ( w , v ). 
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We now set u = u k , P = p k and we consider the scalar product 
in the three-dimensional space [ u , v, w], Using these vectors as 
basis we obtain the matrix 

/3 0 0 

0 0 1- 

.0 1 0. 

Thus if y = £« + rjv + £w and y' = %u + rj'v + f'w, then g(y,y') 
= /3££' + Hence the following vectors 

yi = « + v 

y 2 = u + Pw 

y 3 = u + v + Pw 

are orthogonal in pairs and satisfy g(yi, j») = P- It follows that, 
if we replace the original tii- b y y i and call this vector Ui- again, 
then u u u 2 , •••, u k , u k+1 = y 2> u k+2 = y 3 satisfy g(u h Uj) = 
SijPiy Pi ^ 0. This proves 

Theorem 10. If $ has characteristic two and g(x, y) is a non- 
alternate symmetric scalar product in 9? over i>, then there exists a 
basis of relative to which the matrix of g{x, y) is a diagonal 
matrix .* 

The argument used in the proof of the preceding theorem shows 
that the matrices. 

T 0 0] ri 0 o- 

0 0 1 . 0 10 

.0 1 OJ 10 0 1. 

are cogredient. On the other hand these submatrices 

0 1] ri 0 

1 0J 10 1 

are not cogredient. These observations show that Witt’s theo- 
rem does not hold in the characteristic two case. 

* The canonical form given in this theorem is useful in that it simplifies matrix calcula- 
tions with symmetric matrices over a field of characteristic two. A more geometric dis- 
cussion of symmetric scalar products in the characteristic two case has been indicated by 
Dieudonn6, Sur les Groupes Classiques i p. 62. 
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EUCLIDEAN AND UNITARY SPACES 



As we have pointed out at the beginning of these Lectures, 
Euclidean geometry is concerned with the study of a real vector 

n 

space relative to the scalar product ^ £ 1 * 7 » determined by writ- 

1 

ing x = iUiy y = ^ErjiUi in terms of basic unit vectors that are 
mutually orthogonal. Since this scalar product is fixed, it is cus*- 
tomary to denote it simply as (x> y ) instead of g(x, y) as in the 
preceding chapter. The geometric meaning of (x> y) is clear. It 
gives the product of the cosine of the angle between x and y by 
the lengths of the two vectors. The length of x can also be ex- 
pressed in terms of the scalar product, namely, | x | = (x> x) 1/2 . 

From our point of view the characteristic properties of (, x>y ) 
can be stated by saying that this function is positive definite in 
the sense that (x, x) > 0 for all x ^ 0. In fact, it is customary 
nowadays to axiomatize Euclidean geometry in the following 
way. We suppose SR is a finite dimensional vector space over the 
field of real numbers, and we take in SR a positive definite sym- 
metric scalar product (x y y). The space SR over <£, together with 
the fundamental scalar product, constitutes a Euclidean space . In 
a similar manner we define the complex analogue of a Euclidean 
space as a vector space over the field of complex numbers to- 
gether with a positive definite hermitian scalar product defined 
in this space. In this chapter we shall study properties of these 
spaces and of certain special types of linear transformations in 
these spaces. We shall also consider briefly the theory of analytic 
functions of matrices (or linear transformations). 
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1. Cartesian bases. Let 91 be a Euclidean space in the sense 
that 9 1 is an ^-dimensional vector space over the field 3> of real 
numbers in which a positive definite symmetric scalar product 
{x y y) is defined. Thus the basic properties of this real valued 
function are 



(1) 


(*1 + * 2 , y) 


= (#i,j y) 


+ (x2,y), 




(x,yi + y 2 ) 


= foji) 


+ (x, y 2 ) 


(2) 


(ax, y) 


= «(*> J>0 


= (x, qy) 


(3) 


(*>y) 


= 0, *) 




(4) 


(x, x) 


>0 if 


a; 7 ^ 0 . 



Now we know that, if (x y y ) is any symmetric scalar product in 
a real vector space, then there exists a basis (u ly u 2y •••, u n ) 
such that ( Ui y Uj) = where the Pi are either 1, —1, or 0. If 
(x y y) is positive definite, clearly every Pi = 1. Thus we see that 
there exists a basis (u iy u 2y * • *, u n ) for 9 ? such that 

(5) {u iy uj) = dij . 

Then if x = 2£ *•«*, y = 2 ruu », 

(X>y) == 

as usual. 

A basis which satisfies (5) will now be called a Cartesian basis 
for the Euclidean space. The method of Lagrange for determin- 
ing such a basis is capable of some refinements which we now in- 
dicate. Let (e ly e 2y • • • , e n ) be any basis for 91. Then (ei y ex) > 0. 
Hence if u x — (e ly ex)~^e ly ( u iy Ux) = 1. We next apply La- 
grange’s reduction and set 



f 2 (^2j U\)U\. 



Then (/ 2 , u x ) = ( e 2y u{) — ( e 2y ux)(u ly u x ) = 0. Hence Ui and 
u 2 = (/ 2 ,/ 2)“^/2 satisfy 



(«1, «l) = 1 = (^2, ^ 2 ), («1, « 2 ) = 0 = («2, U X ). 
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Suppose now that (u u u 2y •••, Uk) have already been deter- 
mined so that (uiy Uj) = 8ij and [u u u 2y • • *, «»] = [ei, e 2y • • *, 
for i < k. Let 



fk- J-l = e k + 1 “ (** + !> #l)#l ■” (th+ u U 2 )u 2 — • • • — (tffc + i, Uk)Uk* 



Then (fk+i y «») = 0 for / = 1, 2, • • *, k. Hence if we set &&+i 
= (fk+u fk+i)~ V2 fk+u then (u u u 2 , • •, « & +i) satisfy the condi- 
tion stated for («i, u 2y • • - , «*). Repetition of this method leads 
at length to a Cartesian basis for 9?. The process we have given 
for “orthogonalizing” the basis (ei, e 2y •••, e n ) is often called 
E. Schmidt* s orthogonalization process . 

It is worth noting that the matrix of (#i, u 2y • • •, u n ) relative 
to (e u e 2y •••, * n ) is triangular; for [«i, —•> u i\ = [*i> e 2y 

• • •, d'J. Hence is a linear combination of the ej with j < /. 

i 

Thus Ui = Tijejy and the matrix of the us relative to the *’s is 
,y=i 



( 6 ) 



w 



>11 

T21 T22 



01 



tT n i T n 2 ' * * T nn J 



If (j8) is the matrix of (#, jy) relative to the originally chosen 
basis (eiy e 2y • • •, ^ n ), then we know that the matrix of (#i, u 2y 
• • •, « n ) is (t)(/3)(t)' . On the other hand, by (5) the latter matrix 
is the identity. Hence (r)(j8)(r)' = 1. The inverse (?) of (r) is 
also triangular of the same form as (r) and (0) = (?)(?)'• This 
proves the following 



Theorem 1 . Let (j 8) be a matrix of a positive definite symmetric 
scalar product . Then there exists a triangular matrix (y) such that 

08) = MM'. 



We consider now the relation between Cartesian bases and 
matrices. Let (vi 3 v 2y • • • , v n ) be a definite Cartesian basis. Then 
if (uiy u 2 y • • •, u n ) is a second such basis, the matrices of (*, y) 
relative to these bases is 1. Hence if (<r) is the matrix of («i, u 2i 
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• • *, u n ) relative to (vi> v 2y • • v n ) y then (cr)(<r)' = 1. A matrix 
satisfying this condition: 

(7) WW' = l 

is called an orthogonal matrix. Conversely, if (<r) is any orthogonal 
matrix and Ui = j y then the matrix of ( x , y) relative to 
(u ly u 2y • • *, u n ) is (< 7)1 (o’) 7 = 1. Hence the us form a Cartesian 
basis. Thus we see that, if one Cartesian basis is known, all 
others can be obtained from it by applying orthogonal matrices. 
We have a 1-1 correspondence between the Cartesian bases and 
the orthogonal matrices. 

If ( 0 ) is orthogonal, then (det (or)) 2 = 1. Hence det ( <r ) = ±1. 
If det (<r) = 1, we shall say that (a) is proper , otherwise (<r) is 
improper . It is readily verified that the orthogonal matrices con- 
stitute a subgroup 0(^>, n) of the group L($>, n). The proper or- 
thogonal matrices form an invariant subgroup 0i($>, n) in 0(3>, n) 
and the index of Oi in 0 is two. 

If two Cartesian bases are related by a proper orthogonal ma- 
trix, then we say that the bases have the same orientation and, 
if the matrix relating the bases is improper, we say that the bases 
have opposite orientation . Thus the Cartesian bases fall into two 
mutually exclusive classes, namely, those that have the same 
orientation and those that have opposite orientation to a par- 
ticular Cartesian basis (v iy v 2y • •, v n ). We note that a change 
of sign of one vector or an odd permutation of the vectors changes 
the orientation. 

Suppose again that (e ly e 2y • • •, e n ) is any basis and let (7) be 
the matrix of this basis relative to the Cartesian basis (v iy v 2y 

• • *, v n ). Since the ^s form an arbitrary basis, the matrix (7) is 
an arbitrary matrix in L(<I>, n). Now we have seen that there 
exists a Cartesian basis (# 1 , u 2y • • •, « n ) whose matrix relative to 
(e u e 2y • * •, e n ) is a triangular matrix (r). The matrix of (u u u 2y 

• • •, u n ) relative to (u ly v 2y • • *, v n ) is the product (r)(y). Since 
(«i, u 2y • • *, u n ) is Cartesian, (r)(7) = (a) is orthogonal. Hence 
we have the following 

Theorem 2. If (7) e Z,($, n) y $ the field of real numbers y then 
(7) may be factored as (v) (a) where {y) is triangular and (a) is 
orthogonal . 
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EXERCISES 

1 . Show that the matrix (v) in the preceding theorem can be taken to have 
positive diagonal elements. Show that, if this normalization is made, then ( v ) and 
(a) are uniquely determined by (7). 

2 . Decompose 

f — 1 0 2 -2' 

3 4-1 7 

12 1-1 
.0 2 0 1. 

as a product of a triangular matrix and an orthogonal matrix. 

3 . Show that, if (/ 3 ) is the matrix of a positive definite symmetric scalar 
product, then det (J 3 ) > 0. Generalize this to prove that every diagonal minor 
of (/ 3 ) is positive. 

4 . Prove that, if ( x,y ) is a positive definite symmetric scalar product, then {u y v ) 2 
< («, u)(v , v) for any u and v and that equality holds only if u and v are linearly 
dependent. 

5 . Prove the triangle inequality for lengths of vectors: 

(u + v y u + v ) H < («, u ) M + (v } v ) 

2. Linear transformations and scalar products. If A is a 

linear transformation of 9? over <£ into itself, then g{x y y) = 
(xA y y) is a second scalar product in the space. Conversely let 
g(x> y) be any scalar product in 9J. Then if we hold x fixed, the 
function g(x, y) is linear in y. Hence, as we have shown in Chap- 
ter V, there is a uniquely determined vector xA in 9? such that 
g(x y y) = ( xA y y ) for all y. The mapping A is linear. This 
shows that any scalar product in 9t can be obtained from the fun- 
damental scalar product ( x,y ) by applying a linear transformation 
A in the above manner. The theory of scalar products in 9J is 
therefore equivalent to the theory of linear transformations in 9J. 

For the most part we shall adopt the linear transformation 
point of view. We recall first the definition of the transpose A' 
relative to ( x y y) of the linear transformation A: A' is the linear 
transformation in 9? which satisfies the condition 

(8) {x y yA') = (xA y y) 

for all x y y in 9 1- There is only one linear transformation that 
satisfies this condition, and the requirement (8) can be reduced 
to the n 2 equations 

ujA') = (uiA y Uj) y i y j = 1 , 2, • • •, n 
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for a basis («i, u 2y • • *, « n )- If the basis is Cartesian these con- 
ditions imply that A f is the linear transformation whose matrix 
relative to ( u ly u 2y • • *, u n ) is the transpose of that of A . If 



then 



= HaaUic and UjA ’ = XfijiUi, 

{iii y UjA ) Ui) (3ji 

(ujAy Uj) = = oL%j, 



Hence 03) = (a)'. (Cf. p. 145.) 

We recall the fundamental algebraic properties of the mapping 
A -» A': 

(A + BY = A' + B' y (ABY = B’A 
Also it is clear from (8) and the symmetry of (x y y ) that 



^ 



3. Orthogonal complete reducibility. If © is any subspace 
7 ^ 0 of 9i, (x y y ) is non-degenerate in ©. In fact, (x y x) 0 if 
x 9 ^ 0. It follows from this that the orthogonal complement 
©- 1 - of © is a true complement, that is, 9i = ©® ©^ (cf. p. 151). 
It is easy to determine a basis for ©- 1 . For this purpose one 
needs to have a Cartesian basis (u i, u 2y • • *, u r ) for ©. This can 
be supplemented to a basis {u\ y u 2y • • •, « r ; £ r+i , • * *> <?n) for 9t. 
Then Schmidt’s orthogonalization process yields the Cartesian 
basis («i, ***> «n) 3 and it is clear that ©- 1 - = (& r +i 3 

• • •, «»). 

Suppose now that 12 is an arbitrary set of linear transformations 
in 9? over <£. We shall call 12 orthogonally completely reducible if 
the orthogonal complement © x of any subspace © invariant 
under 12 is also invariant under 12. Since 9? = © ® ©**■, it is clear 
that orthogonal complete reducibility implies ordinary complete 
reducibility. We shall see that this property is enjoyed by many 
important types of linear transformations in 9t. If 12 is orthogo- 
nally completely reducible, we can decompose 9i as a direct sum 
9?i ® 9?2 © * • * © 9J* of subspaces that are invariant and irreduci- 
ble relative to 12 and that are, moreover, mutually orthogonal. 
Thus let 9ii be an irreducible invariant subspace relative to 12. 
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Then $ti ± is invariant and = 9ii © 9?i x . Let 9t 2 be an irre- 
ducible invariant subspace contained in Then 9?i + 9t 2 = 

9ti © 9?2 and these two spaces are orthogonal. Next we may 
write dt = $ 1 ® 9f 2 ® (SRi + 9? 2 )' L - Continuing in this way we 
obtain the required decomposition. 

There is a very useful test for orthogonal complete reducibility 
which is based on the following 

Theorem 3. If 12 is any set of linear transformations and © is 
invariant under 12, then the orthogonal complement © x is invariant 
under 12' the set of transposes of the linear transformations in 12. 

Proof. Let x e © and y e © x . Then xA e © for any A in 12. 
Hence ( xA,y ) = 0 and also (x,yA') = 0. Since this holds for 
all x in ©, yA r e ©•*■. Since y is arbitrary in © x , this shows that 
© x is invariant under 12'. 

We can now state the following important criterion. 

Corollary. A set 12 is orthogonally completely reducible if and 
only if 12 and 12' have the same invariant subspaces . 

4. Symmetric, skew and orthogonal linear transformations. 
Of special interest in Euclidean geometry are the following types 
of linear transformations: Symmetric , defined by the condition 
A' — A y skew defined by A f = —A and orthogonal defined by 
A* = A~ l . These conditions can also be given in terms of the 
associated scalar products. Thus A is symmetric if and only if 
(xA>y) is symmetric. This is readily verified. Similarly A is 
skew if and only if (xA>y) is a skew scalar product in 9 1. If (a) 
is the matrix of A relative to a Cartesian basis (#i, u 2 , • • *, u n )> 
then A is symmetric (skew) if and only if (a) is symmetric (skew). 

In order to see the geometric meaning of orthogonal linear 
transformations we must recall that (u, u) gives the square of 
the length of the vector u. The condition AA r — 1 = A' A im- 
plies that 

(9) (uAy uA) = ( Uy uAA r ) = (#, u). 

Hence A preserves the length of any vector. We now prove the 
converse, namely, if A is any linear transformation that leaves 
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the length of every vector unaltered, then A is orthogonal. For 
if (9) holds for all u , then 



(0 + y)A y (x +y)A ) = (x +y y x +y) 
for all x andy. Expanding and cancelling off equal terms. 



2(x A y yA) = 2{x y y). 



Hence (x y yAA ') = ( x y y) for all x y y. Since (x y y) is non-degen- 
erate this implies that AA f = 1. 

It is clear that A is orthogonal if and only if its matrix (a) 
relative to a Cartesian basis is orthogonal. Another way of stat- 
ing this result is that A is orthogonal if and only if the transform 
( U\A y u 2 A y • • *, u n A ) of a Cartesian basis is a Cartesian basis for 
91. The matrix ( a ) has determinant 1 or determinant —1. In 
the former case ( U\A y u 2 A y • • •, u n A) has the same orientation 
as {u\ y u 2y • • *, & n )- Then A is called a rotation in 9?. 

5. Canonical matrices for symmetric and skew linear trans- 
formations. If © is a subspace invariant under a linear transfor- 
mation A that is symmetric, then clearly © is invariant under A f . 
Hence the set consisting of A alone is orthogonally completely 
reducible. Our discussion of orthogonally completely reducible 
sets suggests the following procedure for obtaining a canonical 
matrix for A . 

Let x be a non-zero vector in 9J and let Mx(X) be its order (see 
p. 67). If 7 r(\) is an irreducible factor (leading coefficient 1) of 
/z x (X) and m*(X) = ttOOK^), then y = xv(A) has the order 7r(X). 
Since $ is the field of real numbers the irreducible polynomial 
7r(X) is either linear or quadratic. We shall now show that the 
symmetry of A assures that 7 r(X) is linear. Otherwise (y y yA) 
is a basis for the cyclic space {jy } , and this basis yields the matrix 



( 10 ) 



0 r 

/3 a. 



where X 2 — a\ — = 7 r(X). On the other hand, if we choose a 

Cartesian basis in {jy } , then we obtain a symmetric matrix 



(ID 



'7 S' 
.5 e. 




180 EUCLIDEAN AND UNITARY SPACES 

for A in {y}. Since (10) and (11) are similar, 

X 2 - a\ - 13 = X 2 - (7 + e)X + (ye - 5 2 ). 

The discriminant of this quadratic is 

(y T - *) 2 ““ 4(ye — S 2 ) = (y — e) 2 + 45 2 > 0 

and this contradicts the irreducibility of 7r(X). Thus we see that 
7 r(X) must be linear so that {y} = [y] is one dimensional and yA 

= py- 

We now replace y by a multiple y i that has length 1 . Then 
yiA = Pijyi, Pi = p. Also 9? = [y d ® [yi] ± and A induces a sym- 
metric transformation in [y i] -1 . Hence we can find a y 2 of length 
1 in [yi] ± such that y 2 A — p 2 y 2 . Next we use the decomposition 
31 = buy 2 ] + buy 2 ^ to obtain a y s of length 1 in \y u jy 2 ]" L such 
that y^A = p 3 jy 3 . We remark that the yi thus obtained are or- 
thogonal in pairs. Hence when we have finished our process, we 
get a Cartesian basis (y 1 , jy 2 > • * y n )» The matrix of A deter- 
mined by this basis is 

( 12 ) diag {pi, p 2 y • • •, p n }. 

If we recall that the passage from one Cartesian basis to another 
is given by an orthogonal matrix, we see that the following theo- 
rem holds. 

Theorem 4. If (a) is a real symmetric matrix , then there exists 
a real orthogonal matrix (a) such that (ii)(a)(<i)~ x is a diagonal 
matrix . 

We can also arrange to have (a) proper. This can be done by 
changing the sign, if necessary, of one of the yi. We note also 
that the p* in the canonical matrix ( 12 ) are the roots of the char- 
acteristic polynomial of this matrix. Hence they are also the 
roots of the characteristic polynomial of (a). We have therefore 
proved incidentally the 

Corollary. The characteristic roots of a real symmetric matrix 
are real . 

We pass now to the theory of a skew linear transformation in 
Euclidean space. Let A be skew and, as above, let y be a vec- 
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tor 7* 0 whose order is an irreducible polynomial ir( X). If jt(X) 
is linear, then yA = py. Hence p(y , y) = (yyf, y) = — (y, yA) 
= ~p{y,y), and this implies that p = 0. If tt(X) is quadratic, 
we choose a Cartesian basis in {y}. We then obtain a skew sym- 
metric matrix 

‘ 0 S' 

.-8 0 . ' 

The polynomial ir(X) is the characteristic polynomial of this ma- 
trix. Hence x(X) = X 2 + 5 2 where 5^0. If we apply the 
method used above for symmetric mappings, we can decompose 
the space as a direct sum of mutually orthogonal spaces i = 
1, 2, • • •, h y such that each ©,■ = {y,}, and the minimum poly- 
nomial ir,(X) ofy,- is either X or it is of the form X 2 + 5< 2 , 6,- 9 ^ 0. 
We can arrange the ©,• so that ir f (X) = X 2 + 5 t - 2 for i = 1, 2, • • •, 
k and tt,(X) = X for i > k. If we choose Cartesian bases in the 
we obtain a Cartesian basis for 91 by stringing these bases 
together. The matrix of A relative to this basis is 



0 Si 

-Si 0 






0 5 2 
-S 2 0 



(13) 



0 S t 

~S & 0 



0 



0J 
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This proves the following 

Theorem 5. If ( a ) is a real skew symmetrix matrix , then there 
exists a real orthogonal matrix (a) such that (a)(a)(o')~ 1 has the 
form (13). 

As for symmetric matrices the canonical form is completely de- 
termined by the characteristic polynomial 

k 

\ n ~ 2k U (x 2 + hi 2 ), ^ ^ o 

1 

of (a). We remark also that the characteristic roots are pure 
imaginaries. 

EXERCISES 

1. If 

■1-2 O' 

( a ) = -2 2 —2 » 

.0-2 3. 

find a proper orthogonal matrix (<r) such that (<r)(a)(<r) — 1 is diagonal. 

2. Prove that, if g(x , y) is a symmetric scalar product in Euclidean space, then 
there exists a Cartesian basis for relative to which the matrix of g is diagonal. 

3. Prove that, if (a) is a real symmetric or skew symmetric matrix of rank p, 
then there is a non-zero p-rowed diagonal minor in (a:). 

6. Commutative symmetric and skew linear transformations. 
If p is a root of the characteristic polynomial /(X) of the sym- 
metric linear transformation, A> then the subspace $Rx- P of vec- 
tors y such that yA = py is ^ 0. We call 9J\_ P the characteristic 
space corresponding to the root p or to the factor X — p of /(X). 
If A is skew and X is a factor of /(X), we define in a similar man- 
ner the characteristic space The other irreducible factors of 
/(X) have the form 7r(X) = X 2 + 5 2 , 5 ^ 0. Corresponding to 
such a factor we define the characteristic space 9?^(X) to be the to- 
tality of vectors y such that yir{A) — 0. As we have seen in the 
preceding section, ^(x> 0. 

If B is any linear transformation which commutes with A , it 
is clear that B maps each characteristic space of A into itself. 
For, evidently, yir(A) = 0 implies that yw(A)B = (jB)tt(A) 
= 0 .* 

* The ideas which occur here have been discussed in a more general form in § 9 of 
Chapter IV. 
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Now let 0 be a set of linear transformations such that 1) any 
two linear transformations of ft commute, 2) every transforma- 
tion of ft is either symmetric or skew. In order to determine 
canonical matrices for the set ft, we consider first the case in 
which ft is an irreducible set. The result that we shall establish 
is the following 

Theorem 6. If a Euclidean space 9? is irreducible relative to a 
commutative set of linear transformations which are either sym- 
metric or skew , then dim 9t < 2. 

Proof. Let 7r(X) be an irreducible factor of the characteristic 
polynomial of any A e ft. Then the characteristic space 9?7r(X) of 
A is invariant relative to ft. Hence 91* (X) = 9t so that 7 t{A) = 0. 
If A is symmetric, then we know that 7r(X) = X — p; hence, in 
this case, A is a scalar multiplication. If A is skew, either 7r(X) 
= X, in which case A = 0, or 7r(X) = X 2 + 5 2 , 8 ^ 0. In the 
latter case A 2 = —8 2 1; hence A ~ 1 exists and is skew. Then if 
B is any other skew transformation in the set, 

M = BA- 1 = A~ X B = (-A-^f-B') = (A- 1 ) / B f = M' 

is symmetric. Since M commutes with every member of ft, the 
argument used before shows that M is a scalar multiplication. 
Hence B = ixA . Thus we see that either the set ft consists of 
scalar multiplications only, or ft consists of scalar multiplications 
and multiples of a single skew A in this set. In the former case 
every subspace of 9i is invariant relative to ft. Hence by the 
irreducibility of 9J, dim 9i = 1. In the second case we can find 
a two-dimensional subspace which is invariant and irreducible 
relative to the chosen skew transformation A . Clearly this space 
is also invariant relative to every member of ft. Hence, here 
dim 9t = 2. This completes the proof. 

Now suppose that ft is an arbitrary commutative set of sym- 
metric or skew linear transformations in a Euclidean space. It 
is clear by the Corollary to Theorem 3 that every set of sym- 
metric and skew linear transformations is orthogonally com- 
pletely reducible. This implies that we may write dt = @i ® © 2 
© • • • © ©a where the are mutually orthogonal and irreducible 
and invariant relative to ft. We can now apply Theorem 6 to 




184 



EUCLIDEAN AND UNITARY SPACES 



conclude that the spaces ©* are either one-dimensional or two- 
dimensional. Moreover, the proof of Theorem 6 shows that the 
symmetric A are scalar multiplications in the ©* and that the 
skew A are multiples of a particular one. If we choose Cartesian 
bases in the © t -, we obtain a Cartesian basis for 9? relative to 
which the matrices of the A e ft all have the form 



(14) 



‘050 

C82) 



(PnV 



where (ft) is either a one- or two-rowed scalar matrix or 



( 15 ) 




€* 

0 . ’ 



In matrix form our result is the following 

Theorem 7. Let co be a set of commutative real matrices which 
are either symmetric or skew symmetric . Then there exists a real 
orthogonal matrix (<r) such that , for each {a) e co, (cr)(a)(o -)“ 1 has 
the form (14). 

EXERCISES 



1. Prove that if A is symmetric, then any two distinct characteristic spaces 
of A are orthogonal. Also show that 91 is a direct sum of the characteristic sub- 
spaces relative to A . 

2. Prove that, if co is a set of commutative real symmetric matrices, then there 
exists a real symmetric matrix (5) such that each (a) e co is a polynomial in (5). 

7. Normal and orthogonal linear transformations. A linear 
transformation A is called normal \i\t commutes with its transpose. 
Special cases of such mappings are the symmetric, skew and 
orthogonal linear transformations. If A is any linear transforma- 
tion, we may write A = \{A + A') + \{A — A f ) = B + C 
where B = \{A + A') is symmetric and C = \{A — A') is 
skew. This decomposition into a symmetric and a skew part is 
unique. For if B + C = Bi + C\ where Bf = Bi and Cf = 
— Ci, then 



(B - B 1 ) = (C x - C). 
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Since B — Bi is symmetric and Ci — C is skew, B — Bi = 0 
= Ci — C. We may therefore call B and C respectively the sym- 
metric and the skew part of A . We note now that A is normal if 
and only if B and C commute. This is clear since B = \(A + A')> 
C = \(A — A ') and A = B + C, A’ — B — C. This remark 
shows that the results on commutative symmetric and skew lin- 
ear transformations can be applied to the study of normal linear 
transformations. Using this method we see that there exists a 
Cartesian basis relative to which B and C have matrices of the 
form (14). Since B is symmetric and C is skew we know that, 
if (fti) is a one-rowed block, then it is 0 for C and if (Pi) is two- 
rowed, then it has the form (15) for C and is scalar for B . The 
matrix of A = B + C is the sum of the matrices of B and of C. 
Hence it has the form (14) where each (Pi) is one-rowed or 



(16) 



m = 



Pi €» 
— *i Pi - 



€i 5* 0 . 

A matrix is called normal if it commutes with its transposed. 
A linear transformation A is normal if and only if its matrix 
relative to a Cartesian basis is normal. Our discussion therefore 
yields the following 



Theorem 8. If (a) is a real normal matrix , there exists an or- 
thogonal matrix (a) such that (a) (a) (a ) -1 has the form (14) where 
each (Pi) is either one-rowed or is a two-rowed matrix of the form 
(16). 

We consider finally the special case of orthogonal transforma- 
tions. Here the one-rowed blocks (Pi) are associated with vec- 
tors yi 7 * 0 such that yiA = Since A does not change the 

length of vectors, Pi = dbl. Also A is orthogonal in the two- 
dimensional subspaces Hence the blocks (16) are orthogonal. 



Pi Ci 




’ Pi — €*•’ 




1 


O' 


. — €i Pi. 




Pi- 




.0 


1 . 



This reduces to the single condition 
(17) Pi 2 + €; 2 = 1. 
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We can determine a number 0» such that cos 0* = pi y sin 0» = e*. 
Then (16) becomes 

’ cos 0i sindi 
l -sin 0i cos 0». 

Theorem 9. Every real orthogonal matrix is conjugate in the 
group of orthogonal matrices to a matrix (14) in which the (ft) are 
either one-rowed matrices (±1) or are of the form (18). 

EXERCISES 

1. Prove that an orthogonal linear transformation is proper or improper ac- 
cording as the multiplicity of the root — 1 of the characteristic polynomial is even 
or odd. 

2. Prove that, if A is a rotation in an odd-dimensional Euclidean space, then 
there exists a vector u 0 such that uA = u . 

3. Prove that every set of orthogonal linear transformations is orthogonally 
completely reducible. 

4. Show that, if 0 is orthogonal and does not have — 1 as a characteristic root, 
then S = (0 — 1)(0 + l) -1 is skew and 0 = (1 + ftCl — ft -1 . Show that, if O 
is orthogonal and 1 is not a characteristic root, then S = (0 + 1)(0 — l) -1 is 
skew and 0 = (S + lXS — l)' 1 . 

8. Semi-definite transformations. A linear transformation A 
is said to be positive definite if it is symmetric and if the asso- 
ciated symmetric bilinear form {xA y y ) is positive definite. This, 
of course, means simply that {xA y x) > 0 for all x 9 ^ 0. It is 
useful also to generalize this notion slightly by defining A to be 
(non-negative) semi-definite if (xA y x) > 0 for all x. It is evi- 
dent from the definition that a positive definite transformation 
is semi-definite and 1-1. The converse holds also. This will 
follow from the following criterion for definiteness and semi- 
definiteness. 

Theorem 10. A symmetric transformation A is positive definite 
{semi-definite) if and only if its characteristic roots are all positive 
0 non-negative ). 

Proof. We know that there exists a Cartesian basis (ji y y 2 , 
• • •, y n ) for such that each jy* is a characteristic vector in the 
sense that yiA = The are the characteristic roots. 

Now (jiA y yi) = (piy iy yi) = Pi(yi y yd has the same sign as p». 
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Hence if A is definite (semi-definite) each p* is positive (non- 
negative). Conversely suppose that p* > 0 (> 0) for all / and 



write x = Then 


/ n 


n > 


( xA , x) = 


( X £*v,vf, 


X hyj 




V i 


1 




( n 


n 


= 


iiPiy if 


X bys 
1 



= Z P^- 2 . 

1 



If all the pi are > 0, this is > 0 unless each £* = 0. Hence pi > 0 
insures definiteness. Also it is clear that, if the pi > 0, then 
(xA, x) > 0. 

If A is 1-1, then 0 is not a characteristic root of A . Hence if 
A is semi-definite and 1-1, all of its characteristic roots are posi- 
tive. Theorem 10 then shows that A is definite. 

If A is an arbitrary linear transformation in Euclidean space, 
then B = AA r is semi-definite since {xB, x) = {xAA r , x) = 
{xA, xA) > 0. We note also that B and A have the same null- 
space; for it is clear that, if zA = 0, then zB = 0. On the other 
hand, if zB = 0, then 0 = (zB, z) = (: zA , zA), and this implies 
that zA = 0. As a consequence of this remark we see that B is 
positive definite if and only if A is 1-1. 

We shall now prove the following useful result: 

Theorem 11. Any semi-definite transformation B has a semi- 
definite square root P ( that is, P 2 = B), and P is unique . 

Proof. The determination of a semi-definite P such that 
P 2 = B is easy. We choose a Cartesian basis (y u y 2 , • • •, y n ) 
such that yiB = p$i and we know that the pi are > 0. Hence 
we can define P to be the linear transformation such that yiP 
= Pi^yi- Clearly P 2 = B. Also since the matrix of P relative 
to a Cartesian basis is symmetric, P is symmetric. Finally 
since the characteristic roots p ^ of P are non-negative, P is semi- 
definite. To prove the uniqueness of P we arrange the charac- 
teristic vectors of B in groups in such a way that the first n\ are 
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those that go with the root pi, the next n 2 are those that go with 
P 2 (t^Pi), etc. Also we introduce the spaces 

9tl = buy 23 * * '>y ni ]y $2 = ^ + 1 )^+ 2 , • • ->y ni +n 2 ]> • • • 

so that 9t = 9Ji © 9?2 © • * * ® 9 ?a (h = number of distinct char- 
acteristic roots). If Ui is any vector in 9t», clearly UiB = p t *«». 
On the other hand, let u = + u 2 + • • • + where e 9J;, be 

any characteristic vector belonging to the root p;, that is, uB = 
piU. Then 

PiU = uB = ( u,\ T - u 2 4“ * * * *T Uh)B = -f- & 2 P -b • • • © u^B 

— P\U\ + P2^2 + • • • + PhUh . 

Hence #y = 0 if j ^ i and u — UiZ 9J*. Thus we see that the 
space 9t* is just the characteristic subspace of 9? corresponding 
to the characteristic root p* of B. As we have seen the charac- 
teristic subspaces of a linear transformation B are invariant with 
respect to any linear transformation C that commutes with B. 
If P is any linear transformation such that P 2 = B y then BP 
= PB and 9t*P c: 9? t *. Thus if P is semi-definite, then P induces 
a semi-definite transformation in each 9t». Now we can find a 
Cartesian basis (wi (0 , w 2 (t) , • • •, w ni (<) ) for 9l»- such that Pwf z) 
= Then Bwf z) = P 2 wf z) = yfwf'K Hence y 2 = p* 

and Yy = p*^. This shows that P coincides with the scalar mul- 
tiplication by pi V2 in the space 9 U. Hence P is the mapping we 
constructed before. 

EXERCISES 

1. Show that any symmetric A whose negative characteristic roots have even 
multiplicities has a square root. 

2. Prove that any symmetric A has a unique symmetric cube root. 

9. Polar factorization of an arbitrary linear transformation. 

Evidently any real number can be written as a product of a non- 
negative real number by one of the numbers 1,-1. This result 
can be generalized to linear transformations in Euclidean space 
as follows 

Theorem 12. Every linear transformation A in Euclidean space 
can be written as a product A — PO where P is semi-definite and 
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0 is orthogonal . P is uniquely determined while 0 is unique if and 
only if A is 1-1. 

Proof. The proof is somewhat simpler in the important case 
in which A is 1-1; hence we consider this case first. We form 
the positive definite transformation B = AA' and its positive 
definite square root P. Set 0 = P~ l A . Then 

00' = P-'AA'P- 1 = P~ l BP~ l = p-iptp-i = 1 

and this shows that 0 is orthogonal. Since A = PO , the exist- 
ence of the representation is established for A 1-1. 

The proof in the general case is basically the same as the fore- 
going. Again we define B = AA\ and we take P to be the semi- 
definite square root of B. Our task is to determine an orthogonal 
transformation 0 such that A = PO. We define first a mapping 
of the space © = into the space ©i = 9 XA by specifying that 
xP — > xA. If xP = yP 9 (x — y)P = 0 and ( x — y)B = 0. 
Since B = AA\ this implies that ( x — y)A = 0 so that xA = 
yA. This shows that our correspondence is single-valued. It is 
now clear that it is linear. Also this mapping preserves lengths 
of vectors, since 



(xA y xA) = {xAA* y x) = ( xBy x) 

and 

(xPy xP) = {xP 2 y x) = (xBy x). 

Now Ay B and P have the same null space; hence they have the 
same rank. It follows that the orthogonal complements ©- 1 and 
have the same dimensionality = h. Now let (u u u 2 , • • Uh) y 
(viy v 2y •••> Vh) be Cartesian bases for these spaces. Then the 
linear transformation of ©- 1 into ©x -1 that sends U{ into Vi pre- 
serves lengths. It follows that the mapping 

0: xP + 'ZaiUi — > xA + XoiiVi 

is an orthogonal transformation. Clearly xPO = xA for all x so 
that A — PO as required. 

If A = POy AA f = P 2 . Hence P is necessarily a square root 
of the semi-definite transformation B = AA f . ^ence P is unique. 
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If A is 1-1, it follows that 0 = P~ X A is also unique. On the 
other hand, if A is not 1-1, then h > 0 in the above notation. In 
this case there are many choices available for the transformation 
of ©^ into ©i* 1 * and these give different determinations of an or- 
thogonal transformation 0. This completes the proof. 

In a similar manner we can establish a factorization A = 0\P\ 
where 0\ is orthogonal and P\ is semi-definite. This can also 
be deduced by applying Theorem 12 to A f . 

EXERCISES 

1. Decompose 

'I -1 2- 

2 1 3 

.1 0 - 5 . 

as a product of a positive definite matrix and an orthogonal matrix. 

2. Show that A = PO is normal if and only if PO = OP. 

10. Unitary geometry. Unitary geometry is the study of a vec- 
tor space over the field of complex numbers relative to a positive 
definite hermitian scalar product (*, y). It is evident at the out- 
set that this geometry will have the essential features of Euclidean 
geometry. On the other hand, we can expect some simplifica- 
tions here due to the fact that the underlying field is algebraically 
closed. Consequently the theory of canonical matrices will be 
simpler than in the real case. It is not necessary to duplicate 
our previous discussion in all detail. In the main we shall be 
content to state the principal results and will give proofs only 
when new methods yield simplifications over the corresponding 
proofs in the Euclidean case. 

We suppose now that $ is the field of complex numbers and 
that (x y y ) is a hermitian scalar product relative to the usual 
mapping a — » a. Then we know that (x y x) is real for any x. 
We shall assume, moreover, that (x y y ) is positive definite in the 
sense that (x y x) > 0 if x ^ 0. This assumption implies that 
there exists a basis (u u u 2y ' y u n ) that is unitary in the sense 
that (uiy uj) = bijy iy j = 1, 2, • • •, n. The passage from one 
unitary basis to another is given by a matrix (cr) that is unitary 
in the sense that 

(*)(*)' = 1 = (*)'(*). 




EUCLIDEAN AND UNITARY SPACES 



191 



These matrices constitute the unitary subgroup £/(<£, n) of £($>, ri) y 
<I> the complex field. The Schmidt orthogonalization process holds 
and this enables us to carry over all of the results of § 1. The 
analogue of Theorem 1 is that, if ((3) is the matrix of a positive 
definite hermitian scalar product , then (13) has the form (v)(v) for 
a suitable triangular matrix (v) with complex elements . The ana- 
logue of Theorem 2 is obtained by replacing the word “real” by 
“complex” and “orthogonal” by “unitary” in its statement. 

The transpose of a linear transformation A is defined as usual. 
If (ui, u 2 , • • *, u n ) is a unitary basis, then the matrix of A' is 
the conjugate transpose (a)' of the matrix of A . Thus A is 
hermitian , A f = A y if and only if (a) is hermitian; A is skew her- 
mitian , A* = — A y if and only if (a) is skew hermitian; A is unitary , 
A r A = 1 = AA* , if and only if (a) is unitary; and A is normal , 
A A' = A’A y if and only if (a) (a)' = (a)' (a). Skew hermitian 
transformations need not be studied as a separate case since their 
theory can be reduced to that of hermitian transformations by 
observing that, if A is skew hermitian, then iA ( i 2 = —1) is her- 
mitian. This follows from the fact that the transpose of the 
scalar multiplication x — » \ix is the scalar multiplication x — > jlx. 
Hence x — > ix is skew hermitian and, since it commutes with 
A , iA is hermitian. 

Suppose now that A is hermitian and let p be a root of the 
characteristic polynomial. Let y be a corresponding characteris- 
tic vector. Then 

p(y>y ) = (py>y) = (y^,y ) = (y,yA) = ( y,py ) = (y,y)p- 

Since (y, y) 0, this implies that p = p is real. We normalize 
y to obtain a multiple jy i such that (ji,yi) = 1. Then also y x A 
= Piyi, P = Pi. If © is any subspace, we denote the orthogonal 
complement consisting of the vectors y' such that (y, y') = 0 
for all y e © by This space is a complement of © and, if © 
is invariant under A , then so is © x . If we apply this remark to 
© = [yi], then we see that 91 = [yi]®|>i] x and that [y^Ac, 
fyi] x . Hence we can find a real number p 2 and a vector y 2 in 
[jVi] J ' such that y 2 A — p 2 y 2 . Next we write 9t = [ yi,y 2 ] © 
[y x, y 2 \’ L > and we repeat the argument with [y u y 2 ] L . This leads 




192 



EUCLIDEAN AND UNITARY SPACES 



finally to a unitary basis relative to which the matrix of A is 
diag {pi, p 2y • y p n } 



where the pi are real. 

Theorem 13. If ( a ) is a complex hermitian matrix , there exists 
a unitary matrix (<r) such that (a) (a) (a) ~ 1 is a real diagonal matrix . 

Evidently a similar result holds also for skew-hermitian matrices. 
As in the Euclidean case we may prove next that, if co is a set of 
commutative hermitian and skew-hermitian matrices, then there 
exists a single unitary matrix (a) such that every (c) (a) (<r) ~ 1 is 
diagonal for every a in co. This leads as before to Theorem 13, 
with the word “real” omitted, for normal matrices and conse- 
quently also for unitary matrices. For the latter the character- 
istic roots are of absolute value 1. This follows from 

I P | 2 ( y,y) = iyd,yA) = (y, y) 

if yA = py . The principal theorem on unitary matrices can 
also be derived directly by using the same argument that we 
used in the hermitian case. The important remark is that, if © 
is a subspace invariant under a unitary transformation, then the 
orthogonal complement ©- 1 is also invariant. 

If A is a hermitian linear transformation, the associated bi- 
linear form (xA,y) is hermitian; for 

(yA, x) = (y, xA) = (xA,y). 

It follows that (xA, x) is real for any x. Now as in the Euclidean 
case we define A to be positive definite (semi-definite) if (xA, x) 
> 0 ( > 0) for all x 0. The discussion that we gave in the 
Euclidean case can be carried over without change. Thus we 
can prove that any semi-definite transformation has a unique 
square root. As before, this can be used to establish the polar 
factorization. Every linear transformation in unitary space can be 
written as A = PU where P is semi-definite and U is unitary . P 
is unique and U is unique if and only if A is 1-1. 
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We shall prove now another theorem that is applicable to arbi- 
trary linear transformations and matrices in a unitary space. 
This is the following 

Theorem 14. If (a) is a matrix with complex elements , then 
there exists a unitary matrix (<r) such that (cr) (a) (a) _1 is triangular . 



Proof. In order to prove this we let A be the linear transfor- 
mation whose matrix relative to some unitary basis v 2 > •• *, 
v n ) is the given matrix (a). If pi is a characteristic root, there 
exists a vector y i such that (y i, yi) = 1 and y±A = pijyi. We 
can find a unitary basis (y i, jy 2 > * * y n ) that includes the vector 
yi. Then, since y±A = piyi, the matrix of A relative to this 
basis is 

[Pi 0 •■■0 1 



(19) 



05) 



P2 1 P22 



P2n 



LPnl Pn2 * * * Pnn-J 



If (ju) is the matrix of (j u y 2 , • • - ,y n ) relative to (v 1} v 2 , • ■ v n ), 
then (ju) is unitary and (/r) (a) (n ) ~ x is the matrix (/3) of (19). We 
may now assume that there exists a unitary matrix ( v ) of n — 1 
rows and columns such that 









P2 


0 


P22 


P2n 








• 




W" 1 = 






Pn 2 


Pnn- 




* 


Pn- 



Then the matrix (r) 



i o • 
-0 «. 



is unitary and 



fPx 



WWW' 1 = (<r)(«)(<r)- x = 



P2 



o 1 



PnJ 



where (a) = (t)(ju) is unitary. 
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EXERCISES 

1. Show that Theorem 14 holds for any set co of commutative complex ma- 
trices. 

2. What is the analogue of Theorem 14 in the real case? 

3. Show that a triangular matrix is normal if and only if it is diagonal. Use 
this to prove the theorem on canonical forms for normal matrices with complex 
elements. 

11. Analytic functions of linear transformations. In treating 
analytic questions on matrices we shall take as our point of de- 
parture the notion of convergence of sequences of matrices. If 
{ (a (A) )}, & = 1, 2, 3, • • •, is an infinite sequence of matrices in 
<£ n , the field of complex numbers, we say that { (a (/b) )} converges 
to (a), ( a (k) ) — » (a), if the sequence of complex numbers aij {k) — » 
ocij for every /, j = 1 , 2, • • •, n. The limit matrix (a) is unique 
since this is the case for sequences of complex numbers. Using 
our definition of convergence it is clear that addition and multi- 
plication of matrices are continuous functions, that is, if (a (/b) ) 
— » ( a ) and (/3 (/b) ) — > (j8), then 



(20) 


(« w ) + (/? w ) 


- («) + C8) 


(21) 


(a w )G8<*>) 


(<*)(£)• 



To convince ourselves of the validity of these basic rules we have 
only to observe that the (i>j) element of the sum (product) of 
two matrices is a continuous function of the 2 n 2 coordinates 
oiij , fiij. An important special case of (21) is that, if {a {k) ) — > ( a ) 
and (ju) is non-singular, then 

(22) (M)(a W )(M) _1 (m)(«)(m) _1 - 

The last result can be used to define convergence for linear 
transformations. Let k = 1, 2, • • •, be an infinite sequence 

of linear transformations in 9? over $ and let (a (A;) ) be the matrix 
of A k relative to a basis (e^ e 2y • * *, e n ) of 9?. We shall say that 
{Ak} converges to the linear transformation A y A k — > A if 
(* w ) - ( a) where (a) is the matrix of A. Because of (22) it 
is clear that the condition A k — > A is independent of the choice 
of basis in 9i. 




